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Abstract 

In these notes, we present some methods and applications of large deviations to 
finance and insurance. We begin with the classical ruin problem related to the Cramer's 
theorem and give en extension to an insurance model with investment in stock market. 
We then describe how large deviation approximation and importance sampling are 
used in rare event simulation for option pricing. We finally focus on large deviations 
methods in risk management for the estimation of large portfolio losses in credit risk 
and portfolio performance in market investment. 
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1 Introduction 



The area of large deviations is a set of asymptotic results on rare event probabilities and 
a set of methods to derive such results. Large deviations is a very active area in applied 
probability, and questions related to extremal events both in finance and insurance appli- 
cations, play an increasingly important role. For instance, recent applications of interest 
concern ruin probabilities in risk theory, risk management, option pricing, and simulation 
of quantities involved in this context. 

Large deviations appear historically in insurance mathematics with the ruin probability 
estimation problem within the classical Cramer-Lundberg model. The problem was then 
subsequently extended to more general models involving for example Levy risk processes. 
In finance, large deviations arise in various contexts. They occur in risk management for 
the computation of the probability of large losses of a portfolio subject to market risk or 
the default probabilities of a portfolio under credit risk. Large deviations methods are 
largely used in rare events simulation and so appear naturally in the approximation of 
option pricing, in particular for barrier options and far from the money options. 

We illustrate our purpose with the following toy example. Let X be a (real-valued) 
random variable, and consider the problem of computing or estimating P[X > £], the 
probability that X exceeds some level t. In finance, we may think of X as the loss of a 
portfolio subject to market or credit risk, and we are interested in the probability of large 
loss or default probability. The r.v. X may also correspond to the terminal value of a stock 
price, and the quantity F[X > £] appears typically in the computation of a call or barrier 
option, with a small probability of payoff when the option is far from the money or the 
barrier I is large. To estimate p = F[X > £], a basic technique is Monte Carlo simulation : 
generate n independent copies X\, . . . , X n of X, and use the sample mean : 

1 n 

S n = -y~) Y u with Yi = l Xi> e- 
t=i 

The convergence of this estimate (when n — > oo) follows from the law of large numbers, 
while the standard rate of convergence is given, via the central limit theorem, in terms of 
the variance v = p(l — p) of Yi : 

F[\S n -p\>^=] - 2*( 

where $ is the cumulative distribution function of the standard normal law. Furthermore, 
the convergence of the estimator S n is precised with the large deviation result, known here 
as the Cramer's theorem, which is concerned with approximation of rare event probabilities 
f[S n € A], and typically states that 

P[|5„-p|>a] ~ Ce^ n , 

for some constants C and 7. 

Let us now turn again to the estimation of p = P[X > £]. As mentioned above, the rate 
of convergence of the naive estimator S n is determined by : 

Var(^) = YaX{lx > e) = P{1 ~ P \ 
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and the relative error is 



, standard deviation of S n ypiX — p) 

relative error = = = — — — . 

mean of S n y/np 



Hence, if p = F[X > £] is small, and since \Jp — p 2 /p — > oo as p goes to zero, we see that 
a large sample size (i.e. n) is required for the estimator to achieve a reasonable relative 
error bound. This is a common occurence when estimating rare events. In order to improve 
the estimate of the tail probability ¥[X > £], one is tempted to use importance sampling 
to reduce variance, and hence speed up the computation by requiring fewer samples. This 
consists basically in changing measures to try to give more weight to "important" outcomes, 
(increase the default probability). Since large deviations theory also deals with rare events, 
we can see its strong link with importance sampling. 

To make the idea concrete, consider again the problem of estimating p = ¥[X > £], and 
suppose that X has distribution fi(dx). Let us look at an alternative sampling distribution 
v{dx) absolutely continuous with respect to fj,, with density f(x) = di//d(i(x). The tail 
probability can then be written as : 

p = F[X > £] = J l x>e n(dx) = J l x>e <f>(x)v(dx) = E u [l x>e cf>(X)], 

where 4> = 1/f, and E„ denotes the expectation under the measure v. By generating 
i.i.d. samples X±, . . . , X n , . . . with distribution u, we have then an alternative unbiased and 
convergent estimate of p with 

n 



n . 
t=i 



and whose rate of convergence is determined by 

Var^n) = (l x>e -pf(x)) 2 4> 2 (x)v(dx). 

The minimization of this quantity over all possible v (or /) leads to a zero variance with 
the choice of a density f(x) = l x >e/p- This is of course only a theoretical result since it 
requires the knowledge of p, the very thing we want to estimate! However, by noting that 
in this case v(dx) = f(x)fi(dx) = l x> i/i(dx) /F[X > £] is nothing else than the conditional 
distribution of X given {X > £}, this suggests to use an importance sampling change of 
measure that makes the rare event {X > £} more likely. This method of suitable change of 
measure is also the key step in proving large deviation results. 

The plan of these lectures is the following. In Section 2, we give some basic tools and re- 
sults on large deviations, in particular the most classical result on large deviations, Cramer's 
theorem. We illustrate in Section 3 the first applications of large deviations to ruin prob- 
lems in insurance industry, and give some extension to an insurance model with financial 
investment opportunity. Section 4 is concerned with the large deviations approximation for 
rare event simulation in option pricing, and we shall use asymptotic results from large de- 
viations theory : Fredilin-Wentzell theory on sample path large deviations, and Varadhan's 
integral principle. Finally, Section 5 is devoted to applications of large deviations in risk 
management, where we use conditional and control variants of the Ellis-Gartner theorem. 
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2 Basic tools and results on large deviations 



2.1 Laplace function and exponential change of measures 



If X is a (real- valued) random variable on (0, T) with probability distribution n(dx), the 
cumulant generating function (c.g.f.) of \i is the logarithm of the Laplace function of X, 
i.e. : 



r(0) 



\nE[e ex ] = In / e 6x ^(dx) £ (-00,00], 8 £ 



Notice that T(0) = 0, and F is convex by Holder inequality. We denote T>(T) = {9 G 
T(9) < 00}, and for any 9 £ f (r), we define a probability measure \i$ on R by : 



fig(dx) 



ex.p(9x-T(0))n(dx). 



(2.1) 



Suppose that X\, . . . , X n , . . . , is an i.i.d. sequence of random variables with distribution \i 
and consider the new probability measure Eg on (Q, T) with likelihood ratio evaluated at 
(X 1 ,...,X n ),neW, by: 



dF e 

dP (Xl ' 



II^(^) = exp(^X,-nr(0) 



(2.2) 



By denoting E# the corresponding expectation under Pg, formula (|2.2p means that for all 

n E N* 



E 



f(X l: . 



f{X x , . . . , X n ) exp ( - 6 x i + nT(9) 



i=i 



(2.3) 



for all Borel functions / for which the expectation on the l.h.s. of (|2.3p is finite. Moreover, 
the random variables X\, . . . ,X n , n G N*, are i.i.d. with probability distribution under 
Eg. Actually, the relation (|2.3|) extends from a fixed number of steps n to a random number 
of steps, provided the random horizon is a stopping time. More precisely, if r is a stopping 
time in N for Xi, . . . ,X n , . . ., i.e. the event {r < n} is measurable with respect to the 
algebra generated by {X±, . . . , X n } for all re, then 



E 



f(X u ...,X T )l 



E 



f(X 1 ,...,X T )exp^-eY,Xi + TT(e))\, x .(2.1.) 

1=1 



for all Borel functions / for which the expectation on the l.h.s. of (j2.4j) is finite. 

The cumulant generating function T records some useful information on the probability 
distributions fig. For example, T'(9) is the mean of \iq. Indeed, for any 9 in the interior of 
2?(r), differentiation yields by dominated convergence : 



r'(9) 



E[Xe 



ex] 



E\e ex ] 



E[Xexp(9X -T(9))] = E e [X]. 



(2.5) 



A similar calculation shows that T"(9) is the variance of fj>$. Notice in particular that if 
lies in the interior of V(T), then r'(0) = ELY] and r"(0) = Var(X). 
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Bernoulli distribution 

Let fi the Bernoulli distribution of parameter p. Its c.g.f. is given by 



F(9) 



ln(l — p + pe ). 



A direct simple algebra calculation shows that fig is the Bernoulli distribution of parameter 
pe 9 /(l -p + pe 9 ). 

Poisson distribution 

Let fj, the Poisson distribution of intensity A. Its c.g.f. is given by 



A direct simple algebra calculation shows that fig is the Poisson distribution of intensity 
Xe s . Hence, the effect of the change of probability measure Fg is to multiply the intensity 
by a factor e e . 

Normal distribution 

Let n the normal distribution A/"(0, a 2 ), whose c.g.f. is given by : 



A direct simple algebra calculation shows that fig is the normal distribution J\f(9a 2 , a 2 ). 
Hence, if Xi, . . . , X n are i.i.d. with normal distribution jV(0, a 2 ), then under the change of 
measure Fg with likelihood ratio : 



the random variables Xi, . . . , X n are i.i.d. with normal distribution J\f(6a 2 , a 2 ) : the effect 
of Fg is to change the mean of Xi from to 6a 2 . This result can be interpreted as the 
finite-dimensional version of Girsanov's theorem. 

Exponential distribution 

Let \x the exponential distribution of intensity A. Its c.g.f. is given by 



A direct simple algebra calculation shows that for 6 < A, \x$ is the exponential distribution 
of intensity A — 6. Hence, the effect of the change of probability measure Fg is to shift the 
intensity from A to A — 6. 

2.2 Cramer's theorem 

The most classical result in large deviations area is Cramer's theorem. This concerns large 
deviations associated with the empirical mean of i.i.d. random variables valued in a finite- 
dimensional space. We do not state the Cramer's theorem in whole generality. Our purpose 



T(9) = A(e*-1). 




i=l 



F(9) 




6<\ 
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is to put emphasis on the methods used to derive such result. For simplicity, we consider the 
case of real-valued i.i.d. random variables Xi with (nondegenerate) probability distribution 
[i of finite mean EXj = J xfi(dx) < oo, and we introduce the random walk S n = Y17=i -^i- 
It is well-known by the law of large numbers that the empirical mean S n /n converges in 
probability to x = EXl, i.e. lim n F[S n /n £ (x — e, x + e)] = 1 for all e > 0. Notice also, 
by the central limit theorem that lim n F[S n /n E [x,x + e)] = 1/2 for all e > 0. Large 
deviations results focus on asymptotics for probabilities of rare events, for example of the 
form P[^k > x] for x > EXi, and state that 

P[— > x] ~ Ce" 7:r , 
n 

for some constants C and 7 to be precised later. The symbol ~ means that the ratio is one 
in the limit (here when n goes to infinity) . The rate of convergence is characterized by the 
Fenchel-Legendre transform of the c.g.f. T of X\ : 

T*(x) = sup [9x - r(0)] G [0,oo], x£R. 



As supremum of afhne functions, T* is convex. The sup in the definition of T* can be 
evaluated by differentiation : for x £ R, if 9 = 9{x) is solution to the saddle-point equation, 
x = T'(9), then T*(x) = 9x — T(9). Notice, from (|2.5p . that the exponential change of 
measure Pg put the expectation of X\ to x. Actually, exponential change of measure is a 
key tool in large deviations methods. The idea is to select a measure under which the rare 
event is no longer rare, so that the rate of decrease of the original probability is given by 
the rate of decrease of the likelihood ratio. This particular change of measure is intended 
to approximate the most likely way for the rare event to occur. 

By Jensen's inequality, we show that T*(JsLX\) = 0. This implies that for all x > EXi, 
= sup0>o [9x — r(0)], and so T* is nondecreasing on \EXi, 00). 

Theorem 2.1 (Cramer's theorem) 
For any x > EXi, we have 

1 S 

lim -mP[— > x] = -T*(x) = -MT*(y). (2.6) 

rwoo n n y>x 

Proof. 1) Upper bound. The main step in the upper bound < of (|2.6p is based on Chebichev 
inequality combined with the i.i.d. assumption on the Xi : 

jp[^>x] = Eris,,^ ] < E[e 9( - Sn ' nx ^] = exp(nT(9)-9nx), V9 > 0. 

n n - 

By taking the infimum over 9 > 0, and since T*(x) = supq >0 [9x — T(9)] for x > EAi, we 
then obtain 

P[— >x] < exp(-nT*(x)). 
n 

and so in particular the upper bound < of (|2,6p . 
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2) Lower bound. Since 



> x > 



ft 



G [x, x + e)], for all e > 0, it suffices to show that 



lim lim inf — In I 



— G \x,x + e) 
n 



> 



-r(x). 



(2.7) 



Suppose that /i is supported on a bounded support so that T is finite everywhere. Suppose 
first that there exists a solution 9 = 9{x) > to the saddle-point equation : T'{9) = x, i.e. 
attaining the supremum in T*(x) = 6{x)x — T(9(x)). The key step is now to introduce the 
new probability distribution fig as in (12. ip and Fg the corresponding probability measure 
on (r^,^ 7 ) with likelihood ratio : 



dFg _ T~~r dflg . 
1=1 

Then, we have by (|2.3p and for all e > : 



exp (eSn-nT(9)). 



Sn _ r \ 

— G x, x + e) 



exp ( - + nT(9)\ Is^^^ 



c+e) 



-n(te-r(e)) E 



exp 



.5: 



c+e) 



> e _„ (ex _r(0)) e -n|0| £]] 



— G x, x + e) 
n 



and so 



-In] 

n 



— G[x,x + e)l > —[Ox — T(6)] — \9\e + — lnP# \— G [x, x + e) 



(2. 



Now, since r'(#) = x, we have Eg[Xi] = x, and by the law of large numbers and CLT : 



lim n P e 



^ G [x,x + e) = 1/2 (> 0). We also have T*(x) = 6x - T(9). Therefore, by 



sending n to infinity and then e to zero in (|2.8p . we get (|2.7|) . 

Now, if the supremum in T*(x) is not attained, we can find a sequence {9 k )k 
that 9 k x - T{9 k ) -> T*(x). Since E[e ek{Xl - x h Xl<x ] -> 0, we then get 



oo, such 



E[e e ^- x h Xl > x ] 



-r*(x) 



as k goes to infinity. This is possible only if F[Xi > x] = and F[X± = x] = e r *( x ). By 
the i.i.d. assumption on the X{, this implies F[S n /n > x] > (P[Xi > x]) n = e - nr *( x ) ^ which 
proves (p77|) . 

Suppose now that ,u is of unbounded support, and fix M large enough s.t. fi([—M,M]) 
> 0. By the preceding proof, the lower bound (|2.7p holds with the law of S n /n conditional 
on {\Xi\ < M,i = 1, . . . , n}, and with a c.g.f. equal to the c.g.f. of the conditional law of 
Xi given \Xi\ < M : 



> 



lim lim inf — In I 

e^0 n—*ao n 

-fl/(x) : = 



G x, x + e, 



\Xi\ <M,i = l,...,n 



sup[9x - f M (9)], 



(2.9) 
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with T M (9) = rnE[e^||Xi| < M] = T M {9) - ln//([-M, M]), T M {9) = lnE[e^l| Xl |< M ] 
Now, by writing from Bayes formula that 



S„ 



G x, x + e) 



*5 r ; 



G x, x + e. 



M,i = 1, .. . ,n 



. (/^([-M,M])) n , we get with ([M 



1 



lim lim inf — In I 

e^O n— »oc n 

> lim lim inf — In I 

e^O n^oo n 



L n 



G \x, x + e) 



> 



-Im(s) 



G x, x + el 

L n 

sup[0x - r M (6>)]. 



< M,i = 1, . . . ,ra 



+ ln/i([-M, M]) 



The required result is obtained by sending M to infinity. Notice also finally that inf y > x T* (y) 
= r*(x) since T* is nondecreasing on [EXi,oo). □ 



Examples 

1) Bernoulli distribution : for X\ ~ B(p), we have T*(x) = x In (p) + (1 — x) In (jE^) f° r £ 
G [0, 1] and oo otherwise. 

2) Poisson distribution : for X\ ~ V{\), we have T*(x) = xln (^) + A — x for x > and oo 
otherwise. 

3) Normal distribution : for Xi ~ Af(0,a 2 ), we have = ^, i £ R. 

2) Exponential distribution : for X\ ~ £ (A), we have T*(x) = Ax — 1 — ln(Ax) for x > 
and T*(x) = oo otherwise. 

Remark 2.1 Cramer's theorem possesses a multivariate counterpart dealing with the large 
deviations of the empirical means of i.i.d. random vectors in W 1 . 

Remark 2.2 (Relation with importance sampling) 

Fix n and let us consider the estimation of p n = ¥[S n /n > x]. A standard estimator for 
p n is the average with N independent copies of X = l Sn / n>x . However, as shown in the 
introduction, for large n, p n is small, and the relative error of this estimator is large. By 
using an exponential change of measure Fg with likelihood ratio 



so that 



Pn 



exp (eS n -nT(6)), 



exp ( - 9S n + nT(9))ls IL>x 



we have an importance sampling (IS) (unbiased) estimator of p n , by taking the average of 
independent replications of 



exp ( - 9S n + nT(9))lsn 



>x- 



The parameter 9 is chosen in order to minimize the variance of this estimator, or equivalently 
its second moment : 



Ml(9,x) 



E 6 



exp (-20S„ + 2711X0))!^ 



>x 



< exp(-2n(0x-T(0))) 



(2.10) 
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By noting from Cauchy-Schwarz's inequality that M%(9, x) > p\ = P[S n /n > x] ~ Ce~ 2nr *^ 
as n goes to infinity, from Cramer's theorem, we see that the fastest possible exponential 
rate of decay of M%(9,x) is twice the rate of the probability itself, i.e. 2T*(x). Hence, 
from (|2.10p . and with the choice of 9 = 9 X s.t. T*{x) = 9 x x — T(9 X ), we get an asymptotic 
optimal IS estimator in the sense that : 

lim — In M^(9 x ,x) = 2 lim — \np n . 

n— »oo n " n^oo n 

This parameter 9 X is such that ¥.g x [S n /n] = x so that the event {S n /n > x} is no more 
rare under ¥g x , and is precisely the parameter used in the derivation of the large deviations 
result in Cramer's theorem. 

2.3 Some general principles in large deviations 

In this section, we give some general principles in large deviations theory. We refer to the 
classical references |13j or [15] for a detailed treatment on the subject. 

We first give the formal definition of a large deviation principle (LDP). Consider a 
sequence {Z £ } e on (fl,^ 7 , P) valued in some topological space X. The LDP characterizes 
the limiting behaviour as e — > of the family of probability measures {P[Z e G dx]} e on X 
in terms of a rate function. A rate function / is a lower semicontinuous function mapping 
I : X — ► [0, oo]. It is a good rate function if the level sets {x £ X : I(x) < M} are compact 
for all M < oo. 

The sequence {Z £ } E satisfies a LDP on X with rate function / (and speed e) if : 

(i) Upper bound : for any closed subset F of X 

limsupelnP[Z e G F] < - inf I(x). 

(ii) Lower bound : for any open subset G of X 

liminf elnP[Z e G Gl > - inf I(x). 

e^O x£G 
If F is a subset of X s.t. inf^g^o J(x) = inf^g^./^) := Ip, then 

lim elnP[Z £ G F] = -I F , 

which formally means that P[Z e G F] ~ Ce-^/ E for some constant C. The classical 
Cramer's theorem considered the case of the empirical mean Z £ = S n /n of i.i.d. ran- 
dom variables in R rf , with e = 1/n. Further main results in large deviations theory are 
the Gartner-Ellis theorem, which is a version of Cramer's theorem where independence is 
weakened to the existence of 

r(0) := limelnEfe 2 ^], 9 G M d . 

LDP is then stated for the sequence {Z £ } £ with a rate function equal to the Fenchel- 
Legendre transform of T : 

T*(x) = sup[0.a>-r(0)], xei d . 
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Other results in large deviations theory include Sanov's theorem, which gives rare events 
asymptotics for empirical distributions. In many problems, the interest is in rare events 
that depend on random process, and the corresponding asymptotics probabilities, usually 
called sample path large deviations, were developed by Freidlin-Wentzell and Donsker- 
Varadhan. For instance, the problem of diffusion exit from a domain is an important 
application of Freidlin-Wentzell theory, and occurs naturally in finance, see Section 14.11 
We briefly summarize these results. Let e > a (small) positive parameter and consider 
the stochastic differential equation in M d on some interval [0,T], 

dX e s = b £ (s,X £ s )ds + ^ea(s,X £ s )dW s , (2.11) 

and suppose that there exists a Lipschitz function b on [0, T] x M d s.t. 

lim b F = b, 

uniformly on compact sets. Given an open set T of [0, T) x R d , we consider the exit time 
from r, 

rl = inf{ S >t :.\V '-'' (••!•}. 
and the corresponding exit probability 

v e (t,x) = F[tI < T], (t,x) E [0,T] x R d . 

Here X e ' t,x denotes the solution to (|2.1ip starting from x at time t. It is well-known that 
the process X e,t,x converge to X°' t,x the solution to the ordinary differential equation 

dX° = b{s,X°)ds, X? = x. 

In order to ensure that v e goes to zero, we assume that for all t E [0,T], 

(H) ier ^ X°/' X £T, VsE[t,T]. 

Indeed, under (H), the system ()2.11j) tends, when e is small, to stay inside T, so that the 
event {rf < T} is rare. The large deviations asymptotics of v e (t,x), when e goes to zero, 
was initiated by Varadhan and Freidlin-Wentzell by probabilistic arguments. An alternative 
approach, introduced by Fleming, connects this theory with optimal control and Bellman 
equation, and is developed within the theory of viscosity solutions, see e.g. [Sj. We sketch 
here this approach. It is well-known that the function v e satisfies the linear PDE 

-^- + b e (t,x).D x v £ + -tr(aa'(t,x)D 2 x v £ ) = 0, (t, x) E [0, T) x T (2.12) 

together with the boundary conditions 

v e (t,x) = 1, (t, x) E [0, T) x dT (2.13) 
v e (T,x) = 0, x£T. (2.14) 
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Here dT is the boundary of T. We now make the logarithm transformation 

V £ = -eh\v £ . 

Then, after some straightforward derivation, (|2.12|) becomes the nonlinear PDE 

~ -Qf-b e (t,x).D x V e - £ -tx{aa'(t,x)DlV £ ) 

+ ^(D x V £ )'aa'(t,x)D x V £ = 0, (t, x) G [0, T) x T, (2.15) 

and the boundary data (|2.13|) - ()2.14|) become 

V £ (t,x) = 0, (t, x) G [0, T) x dT (2.16) 
V £ (T,x) = oo, x£T. (2.17) 

At the limit s = 0, the PDE (pTT5|) becomes a first-order PDE 

- ^ - 6(t, x).D x V Q + i(Dx^o)W(t, x)^^ = 0, (t, x) e [0, T) x T, (2.18) 

with the boundary data (|2.16p - (|2.17p . By PDE- viscosity solutions methods and comparison 
results, we can prove (see e.g. [9] or [19]) that V £ converges uniformly on compact subsets of 
[0, T) x T, as e goes to zero, to Vq the unique viscosity solution to (|2.18p with the boundary 
data (|2.16p - (|2.17p . Moreover, Vo has a representation in terms of control problem. Consider 
the Hamiltonian function 

H(t,x,p) = -b(t,x).p+ -p'aa'(t,x)p, (t, x,p) G [0, T] x T x R d , 

which is quadratic and in particular convex in p. Then, using the Legendre transform, we 
may rewrite 

H(t,x,p) = sup [- q.p- H*{t,x,q)}, 



where 

H*(t,x,q) = sup [-p.q- H(t,x,p)] 



^(q-b(t,x))\aa , (t,x))- 1 (q-b(t,x)), (t,x,q) G [0, T] x T x R d . 



Hence, the PDE (I2.18P is rewritten as 
dV 



, )t + inf j [q.D x V + H* {t, x, q)} = 0, {t, x) G [0, T) x T, 



which, together with the boundary data (|2.16p - (|2,17p . is associated to the value function 
for the following calculus of variations problem : for an absolutely continuous function x(.) 
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on [0, T) and valued in M. d , i.e. x E Hl oc ([0, T], we denote x(u) = q u its time derivative, 
and t(x) the exit time of x(.) from T. Then, 

V (t,x) = inf f H*(u,x(u),x(u))du, (t, x) G [0, T) X T, 

x(,)eA{t,x) J t 

f T 1 

= inf / -(x(u) — b(u,x(u)))'(aa'(u,x(u))) 1 (x(u) — b(u,x(u)))du 
x(.)eA(t,x) J t 2 

where 

A(t,x) = {x(.) £ Hl c {[0,T],R d ) : x{t) = x and t(x)<T}. 
The large deviations result is then stated as 

limelnw e (t,x) = -V (t,x), (2.19) 

and the above limit holds uniformly on compact subsets of [0, T) x T. A more precise 
result may be obtained, which allows to remove the above log estimate. This type of result 
is developed in [T7], and is called sharp large deviations estimate. It states asymptotic 
expansion (in e) of the exit probability for points (t, x) belonging to a set N of [0, T'\ x T 
for some T' < T, open in the relative topology, and s.t. Vq G C°°(N). Then, under the 
condition that 

b £ = 6 + e6i + 0(e 2 ), 

one has 

v e (t,x) = exp(-^^-^ 5 x))(l + 0( £ )), 
uniformly on compact sets of N, where w is solution to the PDE problem 

-^-{b-aa'D x V ).D x w = hiiaa'D 2 x V ) + h.D x V in N 



w(t,x) = on \[0,T) x dTj UN. 
The function w may be represented as 

w(t, x) = J" (~tr(aa'D 2 x V ) + h.D x V ) (s, £(s))ds, 

where ^ is the solution to 

£( 5 ) = {b-aa'D x V ){s,i{s)), t{t) = x, 

and p is the exit time (after t) of (s,£(s)) from N. 

We shall develop more in detail in the next sections some applications of the Gartner- 
Ellis and Freidlin-Wentzell theories in finance. 

We end this paragraph by stating the important Varadhan's integral formula, which in- 
volves the asymptotics behavior of certain expectations. It extends the well-known method 
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of Laplace for studying the asymptotics of certain integrals on R : given a continuous 
function ip from [0, 1] into R, Laplace's method states that 



1 f 1 

lim — In / e nip ( x >dx = max ip(x) 



Varadhan result's is formulated as follows : 

Theorem 2.2 (Varadhan) Suppose that {Z £ } e satisfies a LDP on X with good rate function 
I, and let ip : X — > R be any continuous function s.t. the following moment condition holds 
for some 7 > 1 : 

limsupelnE[e w(ze)/e ] < 00. 

e^O 



Then, 



limelnE[e^ (ze)/e ] = sup [<p(x) - I(x)}. (2.20) 



Proof, (a) For simplicity, we show the inequality < in (|2.20p when <p is bounded on X . 
Hence, there exists M 6 (0, 00) s.t. — M < <p{x) < M for all x £ X. For N positive integer, 
and j £ {1, . . . , N}, we consider the closed subsets of X 

F„, = {xeX: - M+ ^i< v(x) <-M + m h 

so that U^ 1 F/v,j = X. We then have from the large deviations upper bound on (Z £ ), 

limsupelnE[e v(Z£)/£ ] = lim sup e In / e^ (ze)/e P[Z £ £ dx] 

N 



< limsupeln(V I e^ 2 ^ 1 £ ¥[Z £ £ dx]) 

£^0 J Fm , 



N 



< lim sup £ In ( ]T e (-^+2iAf/7V)/ £p[z£ e F Nij ]) 



< limsupeln( max e { - M+2jMIN)/e ^[Z £ £ F N A) 

e^O j=l,...,N 

< max ir (-M + + lim sup e In F[Z £ £ F NJ ] 



2jM 

-t- 



< max f - M + ^4r" + sup \-I{x)\ 
j=l,...,N\ N TCP,,, 



/ 2jM 

< max - M + — — + sup [tp(x) - I(x)\ - inf ip{x t 

r / \ r , „ 2M 

< sup [tp(x) - I[x)\ + — . 
xeF Nj ^ 



By sending N to infinity, we get the inequality < in (12.201) . 
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(b) To prove the reverse inequality, we fix an arbitrary point xq £ X, an arbitrary 5 > 0, 
and we consider the open set G = {x £ X : ip(x) > <p(xq) — 5}. Then, we have from the 
large deviations lower bound on (Z £ ), 

liminfelnEfe^^] > liminf elnE[e^ ze)/e l zeeG ] 

> <p(xq) — 5 + liminf e In P[Z e £ G] 

> ip(xo) — 6 — inf I(x) 

x£G 

> <p(x ) - I(x ) - 6. 

Since xq G X and 5 > are arbitrary we get the required result. □ 

Remark 2.3 The relation (|2.20p has the following interpretation. By writing formally the 
LDP for (Z £ ) with rate function / as ¥[Z £ £ dx] ~ e~ I ^' e dx, we can write 

^ C cxp ( BUp x&x (<p(x) ~ I{X)) 

As in Laplace's method, Varadhan's formula states that to exponential order, the main 
contribution to the integral is due to the largest value of the exponent. 

3 Ruin probabilities in risk theory 

3.1 The classical ruin problem 
3.1.1 The insurance model 

We consider an insurance company earning premiums at a constant rate p per unit of time, 
and paying claims that arrive at the jumps of a Poisson process with intensity A. We 
denote by Nt the number of claims arriving in [0, t], by T n , n > 1, the arrival times of the 
claim, and by £i = T%, £ n = T n — T n _i, n > 2, the interarrival times, which are then i.i.d. 
exponentially distributed with finite mean E£\ = 1/A. The size of the n-th claim is denoted 
Y n , and we assume that the claim sizes Y n , n £ N*, are (positive) i.i.d., and independent 
of the Poisson process. Starting from an initial reserve x > 0, the risk reserve process Xt 
= Xf, t > 0, of the insurance company is then given by : 

Aft 

Xf = x + pt~Y, Y i- (3-1) 

i=l 

The probability of ruin with infinite horizon is 

t/j(x) = F[t x < oo] , 

where r x = inf{i > : Xf < 0} is the time of ruin. We are interested in the estimation of 
the ruin probability, in particular for large values of the initial reserve. 
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3.1.2 The Cramer-Lundberg estimate 

The Cramer-Lundberg approximation concerns the estimation of the ruin probability ift(x), 
and is one of the most celebrated result of risk theory. There are several approaches for 
deriving such a result. We follow in this paragraph a method based on large deviations 
arguments and change of probability measures. 

First, we easily see, by the strong law of large numbers, that 

1 Nt 

- y~]Yj -> p a.s., t -» oo, 

f i=i 

where p = AE[Yi] > is interpreted as the average amount of claim per unit of time. The 
safety loading r\ plays a key role in ruin probability. It is defined as the relative amount by 
which the premium rate exceeds p : 

P — P 

T] = <^=^ p = (\ + rf)p. 

P 

Hence Xf/t — ► p — p = pr\ when t goes to infinity. Therefore, if r\ < 0, Xf — ► — oo, and we 
clearly have ip{x) = 1 for all x. For r\ = 0, we can also show that limsupXf = — oo so that 
ip(x) = 1. In the sequel, we make the net profit assumption : 

_ P -XE[ Yl ] 

11 ~ AE[Yi] > U ' [6 - 2) 

which ensures that the probability of ruin is less than 1. 

Since ruin may occur only at the arrival of a claim, i.e. when X jumps downwards, it 
suffices to consider the discrete-time process embedded at the jumps of the Poisson process. 
We then define the discrete-time process Xj, , n > 1, so that 

ip(x) = ¥[<t x <oo], 

where a x = inf{n > 1 : X^ < 0} = inf{n > 1 : S n > x}, and S n = x — Xj, is the net 
payout up to the n-th claim and given by the random walk : 

S n = Z x + . . . + Z n , Zi = Yi- pi h i£N*. 

The r.v. Zi are i.i.d. and satisfy under the net profit condition, ¥\Z\\ < 0. We denote by 
Tz the c.g.f. of the Zi, and we see that by independence of Yi and £j : 

r z («) = iy(S) + r { (-j>«) 

where Ty (resp. T^) is the c.g.f. of the Yi (resp. For any 9 in the domain of Tz, we 
consider the exponential change of measure with parameter 8, and since a x is a stopping 
time in the filtration of {Z\, . . . , Z n ), we apply formula (|2.4p to write the ruin probability 
as an Eg expectation : 

4>{x) = F[a x < oo] = E[l ax<00 ] 



1(t x «x> exp ( - eS ax + a x T z {9)) . (3.3) 
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We now assume Y has a light-tailed distribution, i.e. : there exists 9 £ (0, oo] s.t. T y (9) < 
oo for 9 < 9, and Ty(9) — > oo as 9 /* 9. In this case, the c.g.f. Tz of the Z{ is finite on 
(—X/p,9), it is differentiable in with T' z (0) = E[2Ti] < under the net profit condition 
(|3.2p . Moreover, since E[Ti] > and Yi is independent of £i, we see that P[2Ti > 0] > 0, 
which implies that Tz(9) goes to infinity as 9 goes to 9. By convexity of Tz and recalling 
that r^(0) = 0, we deduce the existence of an unique 9 L > s.t. Tz(0 L ) = 0. This unique 
positive 9 L is the solution to the so-called Cramer- Lundberg equation : 

r >- (9 ' )+1 "(i^r) = °- 

which is also written equivalently in : 

lAh) = ^ ( 3 - 4 ) 

where j Y = exp(ry (9)) — 1 = J e 9y Fy{dy) — 1 is the shifted (yy (0) = 0) moment generating 
function of Yi, and Fy is the distribution function of the claim sizes Y{. 9 L is called 
adjustment coefficient (or sometimes Lundberg exponent). Notice also that by convexity of 
Tz, we have T' Z (9 L ) > 0. Hence, under P# , the random walk has positive drift Eg [Z n ] = 
T' Z (9 L ) > 0, and this implies P# [a x < oo] = 1. For this choice of 9 = 6 h , (|3.3p becomes 



= Eg L e - y L^ = e- y ^Ee L e - e L( s ^-^) . (3.5) 

By noting that the overshoot S ax — x is nonnegative, we obtain the Lundberg's inequality 
on the ruin probability : 

rp{x) < e~ 6 L x , Vx>0. (3.6) 

Moreover, by renewal's theory, the overshoot R x = S ax — x has a limit R°° (in the sense of 
weak convergence with respect to Pe ), when x goes to infinity, and therefore E^ [e~ l ] 
converges to some positive constant C . We then get the classical approximation for large 
values of the initial reserve : 

i/)(x) ~ Ce~ 9 L x , 
as x — > oo, which implies a large deviation type estimation 

lim -lntp(x) = -0 L . (3.7) 

x— ioo aj 

Further details and extensions can be found in [3] or [16]. They concern more general 
processes (e.g. Levy proceses) for the risk reserve, heavy-tailed distribution for the claim 
size ... In the next paragraph, we study an extension of the classical ruin problem, developed 
by [23] and [31], where the insurer has the additional opportunity to invest in a stock market. 



Application : Importance sampling for the ruin probability estimation 

From the perspective of estimation of the ruin probability ip(x), and by choosing the Lund- 
berg exponent 9l, we have an unbiased estimator with the associated importance sampling 
estimator based on Monte-Carlo simulations of 



tj,(x) = Ee L [e 



e L Sa 

L I 
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Since, obviously, S ax > x, the second order moment of this estimator satisfies 

M 2 (6 L ,x) = Eg L [e~ 2dLS ^] < e~ 2dLX . 

On the other hand, by Cauchy-Schwarz's inequality, the second moment of any unbiased 
estimator must be as large as the square of the ruin probability, and we have seen that this 
probability is 0(e~ 9LX ). Therefore, the IS estimator based on 9l is asymptotically optimal 
as x — > oo : 

lim - In M 2 (6 L ,x) = 2 lim -lmp(x) (= 20 L ). 

x— »oc x x^oo x 

Remark 3.1 For any x > 0, 9 > 0, consider the process 

M t {x,9) = exp(-dXf), t>0. 

where Xf is the risk reserve process defined in (I3.1|) . A standard calculation shows that 
for all t > 0, 

E[M t (O,0)] = e ^y(«)-P«)*. 

Moreover, since Xf has stationary independent increments, and by denoting F = {J-t)t>Q 
the filtration generated by the risk reserve X, we have for all < t < T 

K[M T (x,9)\r t ] = E[e- ex T\T t ] 

= M t {x,9)E[e- e( - x T- x ?)\F t } 

= M t (x,9)E[M T -t(0,e)] 

= M t {x,0) e^rW-**)<T-*\ (3. 8 ) 

Hence, for the choice of 9 = 9 L : the adjustment coefficient, the process M t (x, 9 L ), t > 0, is 
a (P, F)-martingale. The use of this martingale property in the derivation of ruin estimate 
was initiated by Gerber [23]. We show in the next paragraph how to extend this idea for a 
ruin problem with investment in a stock market. 

3.2 Ruin probabilities and optimal investment 
3.2.1 The insurance-finance model 

In the setting of the classical model described in the previous paragraph, we consider the 
additional feature that the insurance company is also allowed to invest in some stock market, 
modeled by a geometric brownian motion : 

dS t = bS t dt + aS t dW t , 

where b, a are constants, a > 0, and W is a standard brownian motion, independent of the 
risk reserve X as defined in (|3.ip . We denote by F = (J-"t)t>0 the filtration generated by X 
and S. The insurer may invest at any time t an amount of money at in the stock, and the 
rest in the bond (which in the present model yields no interest). The set A of admissible 
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investment strategies is denned as the set of F-adapted processes a = (at) s.t. f a 2 s ds < 
oo a.s. Given an initial capital x > 0, and an admissible investment control a, the insurer's 
wealth process can then be written as 



+ pt~y2Yi+ / a u (bdu + adW u ), t > 0. 



We define the infinite time ruin probability 

i/;(x,a) = F[T x>a < oo], 

where r xa = inf{i > : V^' a < 0} is the time of ruin, depending on the initial wealth x 
and the investment strategy a. We are interested in the minimal ruin probability of the 
insurer 

ip*{x) = inf ip(x, a). 
3.2.2 Asymptotic ruin probability estimate 

The main result is an asymptotic large deviation estimation for the minimal ruin probability 
when the initial reserve goes to infinity : 



Theorem 3.1 We have 



lim -lni;*(x) = -9*, (3.9) 

x^oo x 

where 6* > is he unique positive solution to the equation 

^ - n + ik- (3 - 10> 

Here J Y (9) = ^[e eYl ] — 1 is the shifted moment generating function of the claim size. 
Moreover, the constant strategy a* = -Jqp; is asymptotically optimal in the sense that 

lim — lnip(x,a*) = —9*. 

x^oo x 

Finally, ifby^0,9* > 9 L the Lundberg exponent. 

Remark 3.2 The estimate (|3.9p is analogue to the classical Lundberg estimate (|3,7p with- 
out investment. The exponent is larger than the Lundberg one, and thus one gets a sharper 
bound on the minimal ruin probability. Moreover, the trading strategy yielding the opti- 
mal asymptotic exponential decay consists in holding a fixed (explicit) amount in the risky 
asset. This surprising result, in apparent contradiction with the common believe that 'rich' 
companies should invest more than 'poor' ones, is explained by the fact that minimization 
of ruin probability is an extremely conservative criterion. 
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We follow the martingale approach of Gerber for stating this theorem. We emphasize 
the main steps of the proof. Let us introduce, for fixed x,9 G R+, and a £ A, the process 

M t (x,9,a) = exp(-0Pf' a ), t>Q. 

Then, a straightforward calculation shows that for any constant process a = a, and t > 0, 

E[M t (O,0,a)] = E[e- e(p *-^=i yi+aW+a,jW/t) ] 

= e -0(p+a6)* E r e eESi ^]E[e" 9a<TW/t ] 

= e -0(p+a6)i e 7 y (e)Ai e ^^t 
= e /(M* 5 

where 

/(0,a) = Xj Y (9)-p9-ab9+^a 2 9 2 a 2 . 

We recall that under the assumption of light-tailed distribution on the claim size 1^, the 
shifted moment generating function 7 y is finite and convex on (—00, 9) for some 9 E (0, 00], 
and 7 y — > 00 as 9 goes to 9. Moreover, recalling that E[Yi] > 0, then 7^ is increasing on 
[0,0) since 7^(0) = E[Yie eyi ] > E[Yi] for < < 0. Now, we see that for all > 0, 

f(9) := mf/(0,a) = A 7y (0)-p0-^, 

with an infimum attained for o(0) = b/(9a 2 ). From the properties of j Y , we clearly have 
the existence and uniqueness of 9* solution to /(0*) = 0, i.e. A3. lQf) . Since the r.h.s. of 
(I3.10j) is just the r.h.s. of (|3l| . but shifted by the positive constant b 2 /2a 2 X (if b / 0), it 
is also obvious that 0* > 9 L . By choosing a* = a(9*) = b 2 /(9*a 2 ), we then have 

1(9*) = f(9*,a*) = 0. 

A straightforward calculation also shows that for all a£l, 

f(9*,a) = ^(0*)V(a-a*) 2 > 0. (3.11) 

Hence, since V x ' a has independent stationary increments, we obtain similarly as in (|3.8p . 
for all < t < T, 

E[M T (x,0*,a*)\F t } = M t (x,9\a*)E[M T -t(0,9*,a*)} 
= M t (x,9,a), 

which shows that the process M(x,9* ,a*) is a (P, E)-martingale. Therefore, from the 
optional sampling theorem at the (bounded) stopping time r x ^ a * A T, we have 

e~ e * x = M (x,9*,a*) = E[M T ^, AT (x, 0*, a*)} 

= E[M Tx ^(x,9*,a*)l rx ^< T ] +M[M T (x,9*,a*)l Tx ^ >T ] 
> E[M Tx ^(x,9*,a*)l Tx ^< T ], 
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since the process M is nonnegative. By the monotone convergence theorem, we then get 
by sending T to infinity into the previous inequality 

e~ e * x > E[M Txa »(x,e*,a*)l Txa * <O0 ] = E[M Tw >a . (a, 9* , a*)\r x>a * < oo)F[T x>a * < oo], 

from Bayes formula. Thus, we get 

e -e*x 

i/)(x,a*) = ¥[r XyQ * < oo] < 



E[M Tx ^(x,9*,a*)\T x , a * < oo]' 

Now, by definition of the time of ruin, is nonpositive and so M T (x, 9* ,a*) > 1 a.s. 

on {r x , a * < oo}. We deduce that 

iP*{x) < ip(x,a*) < e~ 9 * x . (3.12) 

In order to state a lower bound on the minimal ruin probability, we proceed as follows. 
We apply Ito's formula to the process M(x, 9* , a) for arbitrary a £ A : 

^X^Ca) = ( " r iP + hat) + \^ W ^ dt ~ 9 * adW t + ^* YNt ~ 
Observing that J Y (9) = E[e 6YN t — 1], we rewrite as 

^X^Ca) = (- *(P + b»t) + l(9*) 2 a 2 a 2 t +X lY (Ol)dt-9*adW t 

+ (e e * YN t - l)dN t - XE[e e * YN t - l}dt 
= f{9\at)dt-9*adWt + dN t , 

where N t = fQ{e e * YNu — l)dN u — J XE[e e * YNu — l]du. By using the martingale property of 
N t — At, we can check that TV is a martingale. Since /(#*, a t ) > a.s. for all t (see (|3.1ip ). 
we deduce that M(x,9*,a) is a (local) submartingale. To go to a true submartingale, we 
need some additional assumption on the distribution of the claim size. Actually, we can 
prove that under the following uniform exponential tail distribution 

f°° e~ e ' "( z ~y) dF (v) 
supE[e-**( z - yi )|Yi > z] = sup^ j^- — < oo, (3.13) 

z>0 z>0 J z dFy 

the process M(x,9*,a) is an uniformly integrable submartingale. Therefore, from the 
optional sampling theorem at the (bounded) stopping time r XiQ A T, we have 

e - e * x = M (x,9*,a) < E[M r ^ Ar (x, 9\ a)} 

= E[M Tx Jx,9*,a)l Tx!a < T ]+E[M T (x,9*,a)l TXta>T }. (3.14) 

We now claim that Mt(x, 9* , a) converges a.s. on {t X)OA = oo} to zero as T goes to infinity. 
First, we know from Doob's submartingale convergence theorem that \\m.T->oo Mt{x, 9* , a) 
exists a.s., hence also Wmx-^oo Vr(x, ol). Since the expectation of a jump size ^[li] is 
positive, there exists y > s.t. P[Y"i > y] > 0. By independence of the jump sizes in the 
compound Poisson process of the risk reserve, it is then an easy exercice to see that with 
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probability 1, a jump of size greater than y occurs infinitely often on [0, oo). On the other 
hand, the stochastic integral due to the invesment strategy a in the stock price, is a.s. 
continuous, and so cannot compensate the jumps of the compound Poisson process, greater 
than y, which will occur infinitely often a.s. It follows that on {r XiCl = 00} (where ruin does 
not occur), V^' a cannot converge to a finite value with positive probability. Therefore on 
{T x ,a = 00}, we have limr_ >00 Vr(x, a) = 00 and thus, since 6* > 0, limr->oo Mt(x, 9*, a) = 

a.s. As T —* 00, we have then from the dominated convergence theorem [Mt{x, 6* , a) < 

1 on {r x ,a > T}) for the second term in (|3.14j) . and by the monotone convergence theorem 
for the first term, 

e~ rx < E[A4 Tx Jx,e*,a)l Tx , a<OQ ] = E[M T(M ,(a;, 0*, a)\r x>a < oo]¥[t Xj(X < oo], 
and so 

- Hr„<=o] > m -^L f —^ . (3.15) 

We finally prove that E[M Tx a (x, 6*, a)\T x>a < 00] is bounded by a constant independent of 
a £ A. Fix some arbitrary en £E A and set for shorthand notation t — T x a the time of ruin 
of the wealth process V x ' a . First observe that ruin {t < 00} occurs either through the 
brownian motion, i.e. on {r < 00, V^l a = 0}, and in this case Vr' a = and so M T (x, 0* , a) 
= 1, or through a jump, i.e. on {t < 00, V x l a > 0}, and in this case V x ' a < and so 
M T (x,6*,a) > 1. Hence, 

E[M T (x,6*,a)\T < 00] < E[M T (x,e*,a)\r < oo,V*l a > 0] 

= E[e~ e * (V ^ Q - y ^)| r < oo,t£i a > 0]. (3.16) 

Let H x ' a (dt,dz) denote the joint distribution of r and V l a conditional on the event {r < 
00, V*l a > 0} that ruin occurs through a jump. Given r = t and yf_l Q = z > 0, a claim 
Y/v T occurs at time t and has distribution dF Y (y)/ dF Y for y > z. Hence 

ne -6*(V^-Y NT ) ]r<oo ^ x , a>0] = r r H *,a {dt dz) r e -9*{ Z -y)^yM. 

Jo Jo Jz J z dF Y 

< sup /"e^M^M < 00, (3.17) 



z>0 



by assumption (|3.13p . By setting 

a - - inf ^ ^ y (= en 11 

sun f°°p-^MjfVM. *>o f°° e-0'(*-v)dF Y (y) 1 ' J ' 

J z Y 

we then have from (|3TT5]) - (|3"3B|) - (j3TTT]) . for all s > 0, 

Together with the upper bound (|3.12|) . this completes the proof of Theorem 13. 1[ 
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4 Large deviations and rare event simulation in option pri- 
cing 



4.1 Importance sampling and large deviations approximations 

In this paragraph, we show how to use large deviations approximation via importance 
sampling for Monte-carlo computation of expectations arising in option pricing. In the 
context of continuous-time models, we are interested in the computation of 



where S is the underlying asset price, and g is the payoff of the option, eventually path- 
dependent, i.e. depending on the path process S t , < t < T. The Monte-Carlo approxima- 
tion technique consists in simulating N independent sample paths (£?)o<t<T, i = 1, ■ ■ ■ , N, 
in the distribution of (St)o<t<T, an d approximating the required expectation by the sample 
mean estimator : 



The consistency of this estimator is ensured by the law of large numbers, while the error 
approximation is given by the variance of this estimator from the central limit theorem : 
the lower is the variance of g(S), the better is the approximation for a given number N of 
simulations. As already mentioned in the introduction, the basic principle of importance 
sampling is to reduce variance by changing probability measure from which paths are gen- 
erated. Here, the idea is to change the distribution of the price process to be simulated 
in order to take into account the specificities of the payoff function g. We focus in this 
section in the importance sampling technique within the context of diffusion models, and 
then show how to obtain an optimal change of measure by a large deviation approximation 
of the required expectation. 

4.1.1 Importance sampling for diffusions via Girsanov's theorem 

We briefly describe the importance sampling variance reduction technique for diffusions. 
Let X be a d-dimensional diffusion process governed by 



where (Wt)t>o is a n-dimensional brownian motion on a filtered probability space (O, J 7 , F = 
(•^t)t>0) F), and the Borel functions b, £ satisfy the usual Lipschitz condition ensuring the 
existence of a strong solution to the s.d.e. (|4.ip . We denote by Xt' x the solution to (|4.ip 
starting fom x at time t, and we define the function : 



E g(S t ,0< t < T) 




dX s 



b{X s )ds + E(X s )dW s 




v(t, x) 



E g(Xl' x ,t < s < T) , (t, x) G [0, T] x M. d . 



Let <j) = (4>t)o<t<T be an Revalued adapted process such that the process 




< t < T, 
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is a martingale, i.e. E[Mp] = 1. This is ensured for instance under the Novikov criterion : 
E[exp (i Jq \(f> u \ 2 du)] < oo. In this case, one can define a probability measure Q equivalent 
to P on (n,^r) by : 

dQ 

Moreover, by Girsanov's theorem, the process W t = W t + £(f> u du,0<t<T, is a brownian 
motion under Q, and the dynamics of X under Q is given by 

dX s = (b(X s )-Y,{X s )<j> s )ds + Y,(X s )dW s . (4.2) 

From Bayes formula, the expectation of interest can be written as 

v(t, x) = E Q \g(Xl' x ,t < s < T)L T ] , (4.3) 

where L is the Q-martingale 

Lt = 4f = exp ( f ^dWu -\ f \<Pu?du) , < t < T. (4.4) 



The expression (|4.3p suggests, for any choice of (ft, an alternative Monte-Carlo estimator for 
v(t,x) with 

1 - 

by simulating N independent sample paths (X l ' t,x ) and U T of (X t,x ) and Lt under Q given 
by (|4.2p - (|4.4p . Hence, the change of probability measure through the choice of 4> leads to a 
modification of the drift process in the simulation of X. The variance reduction technique 
consists in determining a process (f>, which induces a smaller variance for the corresponding 
estimator I g ^ than the initial one I 9) o- The two next paragraphs present two approaches 
leading to the construction of such processes (f>. In the first approach developed in [22], the 
process 4> is stochastic, and requires an approximation of the expectation of interest. In the 
second approach due to [25], the process <t> is deterministic and derived through a simple 
optimization problem. Both approaches rely on asymptotic results from the theory of large 
deviations. 



4.1.2 Option pricing approximation with a Freidlin-Wentzell large deviation 
principle 

We are looking for a stochastic process (j>, which allows to reduce (possibly to zero!) the 
variance of the corresponding estimator. The heuristics for achieving this goal is based 
on the following argument. Suppose for the moment that the payoff g depends only on 
the terminal value Xt- Then, by applying Ito's formula to the Q-martingale v(s, X t s ' x )L s 
between s = t and s = T, we obtain : 

g(x¥)Lr = v(t, x)L t + £ L s (D x v(s, X^)'S(X^) + v(x, Xl^dW,. 
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Hence, the variance of I^,(t,x) is given by 



The choice of the process 4> is motivated by the following remark. If the function v were 
known, then one could vanish the variance by choosing 

' H'(XZ*)D x v(s,X?>), t<s<T. (4.5) 



v(s,X s 



Of course, the function v is unknown (this is precisely what we want to compute), but this 
suggests to use a process 4> from the above formula with an approximation of the function 
v. We may then reasonably hope to reduce the variance, and also to use such a method 
for more general payoff functions, possibly path-dependent. We shall use a large deviations 
approximation for the function v. 

The basic idea for the use of large deviations approximation to the expectation function 
v is the following. Suppose the option of interest, characterized by its payoff function g, 
has a low probability of exercice, e.g. it is deeply out the money. Then, a large proportion 
of simulated paths end up out of the exercice domain, giving no contribution to the Monte- 
carlo estimator but increasing the variance. In order to reduce the variance, it is interesting 
to change of drift in the simulation of price process to make the domain exercice more 
likely. This is achieved with a large deviations approximation of the process of interest 
in the asymptotics of small diffusion term : such a result is known in the literature as 
Freidlin-Wentzell sample path large deviations principle. Equivalently, by time-scaling, 
this amounts to large deviation approximation of the process in small time, studied by 
Varadhan. 

To illustrate our purpose, let us consider the case of an up-in bond, i.e. an option that 
pays one unit of numeraire iff the underlying asset reached a given up-barrier K. Within a 
stochastic volatility model X = (5, Y) as in (14. ip and given by : 

dS t = o{Y t )S t dWt (4.6) 
dY t = I7 (y t )di + 7 (y t )dW? > d<W x ,W 2 >t= pdt, (4.7) 

its price is then given by 

v(t,x) = ni maXt ^ T s^> K ] = P[r t , a < T), t€ [0,T], x = (s,y) G (0, oo) x R, 

where 

r tim = M{u>t iXftT}, T = (0,K)xR. 

The event {maxt< u <T Su X > K} = {r t < T} is rare when x = (s,y) G T, i.e. s < K 
(out the money option) and the time to maturity T — t is small. The large deviations 
asymptotics for the exit probability v(t,x) in small time to maturity T — t is provided by 
the Freidlin-Wentzell and Varadhan theories. Indeed, we see from the time-homogeneity of 
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the coefficients of the diffusion and by time-scaling that we may write v(t,x) = w T _ t (0,x), 
where for e > 0, w e is the function defined on [0, 1] x (0, oo) x M by 

w e (t,x) = P[< K <1], 

and X £,t,x is the solution to 



dX £ s = eb(X £ s )ds + VeZ(X £ s )dW s , X, 



x. 



and r e = inf{s > t : Xt ,t,x ^ T}. From the large deviations result (|2.19p stated in 
paragraph 12.31 we have : 



lim -(T -t)lnv(t,x) = V (0,x), 

t/T 



where 



f 1 1 

V (t,x) = inf / -x(u)'M(x(u))x(u)du, (t, x) G [0, 1) x T, 
x(.)eA(t,x)J t 2 

= [ a ^ | is the diffusion matrix of X = (S,Y), Mix) = (SS'(x))- 1 , and 
V 7W J 

A(t,x) = {x(.) G F/ oc ([0, 1], (0,oo) x E) : x(t) = s and r(x) < 1}. 

Here, for an absolutely continuous function on [0,1) and valued in (0, oo) x M, we 
denote x(u) its time derivative, and t(x) the exit time of x(.) from T. 

We also have another interpretation of the positive function Vq in terms of Riema- 
nian distance on W 1 associated to the metric M(x) = (SS'(x)) -1 . By denoting L$(x) = 
-y/2Vo(0, x), one can prove (see [3l]) that Lq is the unique viscosity solution to the eikonal 
equation 

(D x L o ytt(x)D x L = 1, xeT 
L (x) = 0, xedT 

and that it may be represented as 



L (x) = ML (x,z), xeT, (4.8) 



-i 

Lq(x,z) = inf / \J x(u)'M(x(u))x(u)du, 



where 



x(.)eA(x,z) Jo 

and A(x,z) = {x(.) G H} oc ([0, 1], (0, oo) x R) : x(0) = x and x(l) = z}. Hence, the 
function Lo can be computed either by the numerical resolution of the eikonal equation or 
by using the representation (|4.8j) . Lq(x) is interpreted as the minimal length (according to 
the metric M) of the path x(.) allowing to reach the boundary dT from x. From the above 
large deviations result, which is written as 



lnv(t,x) ~ ^^r, as T-t^0, 

v ' ; 2(T -t) 
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and the expression (|4.5p for the optimal theoretical <fi*, we use a change of probability 
measure with 

<f>(t,x) = p^X'(x)D x L (x). 

Such a process 4> may also appear interesting to use in a more general framework than 
up-in bond : one can use it for computing any option whose exercice domain looks similar 
to the up and in bond. We also expect that the variance reduction is more significant as 
the exercice probability is low, i.e. for deep out the money options. In the particular case 
of the Black-Scholes model, i.e. cr(x) as, we have 



and so 



1 s 

<i>(t,x) = —— rln( — ), s<K. 



a(T-t) X K 

4.1.3 Change of drift via Varadhan-Laplace principle 

We describe here a method due to [25] . which, in contrast with the above approach, does not 
require the knowledge of the option price. This method restricts to deterministic changes 
of drift over discrete time steps. Hence, the diffusion model X for state variables (stock 
price, volatility) is simulated (eventually using an Euler scheme if needed) on a discrete 
time grid = to < t± < . . . < t m = T : the increment of the brownian motion from to 
ti is simulated as ^Jti — ti-iZi, where Z\, . . . , Z m are i.i.d. n-dimensional standard normal 
random vectors. We denote by Z the concatenation of the Z^ into a single vector of lengh I 
= ran. Each outcome of Z determines a path of state variables. Let G(Z) denote the payoff 
derived from Z, and our aim is to compute the (path-dependent) option price E[G(Z)]. For 
example, in the case of the Black-Scholes model for the stock price S, we have 

2 

Su = exp ( - ?—(ti - ti-i) + a^Jti - U-iZij , 

and the payoff of the Asian option is 

1 m 

G(Z) = G(Z u ...,Z m ) = f-J2s ti -K) 

i=l 

We apply importance sampling by changing the mean of Z from to some vector /i = 
(Hi, . . . , Hm). We denote P M and E M the probability and expectation when Z ~ I m ). 
Notice that with the notations of paragraph l4.1.1l this corresponds to a piecewise constant 
process <f> s.t. <pt = —fc/y/ti — tj^l ° n \Pi-li U)- By Girsanov's theorem or here more simply 
from the likelihood ratio for normal random vectors, the corresponding unbiased estimator 
is then obtained by taking the average of independent replications of 

tf M = G(Z)e~v' z+ ^ ,fl , 
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where Z is sampled from I m ). We call a //-IS estimator. In order to minimize over 
[i the variance of this estimator, it suffices to minimize its second moment, which is given 
by: 



M 2 (/i) 



E, 



G(Z) 2 e 



-2/j,' Z+fj,' n 



E 



We are then looking for an optimal /x solution to 



inf M 2 Gu) 



infE 



G(zy e 



2 - M 'Z+±// M 



(4.9) 



This minimization problem is, in general, a well-posed problem. Indeed, it is shown in [2] 
that if F[G(Z) > 0] > 0, and E[G(Z) 2+S ] < oo for some 8 > 0, then M 2 (.) is a strictly convex 
function, and thus \i* solution to (|4.9p exists and is unique. This fj,* can be computed by 
solving (numerically) VM 2 (/i) = 0. This is the method adopted in [2] with a Robbins- 
Monro stochastic algorithm. We present here an approximate resolution of (|4.9p by means 
of large deviations approximation. For this, assume that G takes only nonnegative values, 
so that it is written as G{z) = exp(F(z)), with the convention that F(z) = —oo if G(z) = 
0, and let us consider the more general estimation problem where Z is replaced by Z e = 
\J~eZ and we simultaneously scale the payoff by raising it to the power of 1/e : 

v £ = n^ F{Z£) \ 

The quantity of interest E[G(Z)] = E[e F(z) ] is v £ for e = 1. We embed the problem of 
estimating v\ in the more general problem of estimating v e and analyze the second moment 
of corresponding IS estimators as e is small, by means of Varadhan-Laplace principle. For 
any fj,, we consider /i e -IS estimator of v £ with // = ^i/y/e : 



if 



e \F(^z) e -^z+y ef i e 



where Z is sampled from Af(/j, £ ,I m ) = Af(/J,/y/e, I m ). Its second moment is 



M e V) 



E 



it. 



l(2F(Z E )-2 M '2 e + At ' At ) 



E 



Now, from Cramer's theorem, {Z £ ) £ satisfies a LDP with rate function I{z) = \z' z. Hence, 
under the condition that F(z) < c\ + ciz ' z for some C2 < 1/4, one can apply Varadhan's 
integral principle (see Theorem I2.2[) to the function z — > 2F(z) — y! z + an< i § e t 



limelnM^) 



sup [2F(z) 



-z z 



This suggests to search for a \i solution to the problem : 



inf sup [2F(z) 



1 



1 



fiz+-nn--zz\ 



(4.10) 



(4.11) 



This min-max problem may be reduced to a simpler one. Indeed, assuming that the con- 
ditions of the min/max theorem hold, then the inf and sup can be permuted, and we 
find 



inf sup [2F{z) — fj,'z H — ///i 

A* 2 2 



1 



sup 



inf ( — fjfz + -////) + 2F(z) - -z'^ 
A( 2 2 



2sup[F(z) 



2 J 



(4.12) 
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Actually, under suitable convexity conditions on F and its domain, one can show (see |25j) 
that (|4.12p holds. Furthermore, if z is solution to 

sup (F(z)-lz'z), (4.13) 

2 2 

then a solution jl solution to (|4.1ip should be identified with the conjugate of z, via 
inf^ ( — li'z + j/^V) that is fx = z. The solution to problem (|4.13j) has also the following 
interpretation. From heuristic arguments of importance sampling (see the introduction), 
an optimal effective importance sampling density should assign high probability to regions 
on which the product of the integrand payoff and the original density is large. For our 
problem, this product is proportional to 

since exp(— z'z/2) is proportional to the standard normal density. This suggests to choose 
fx = jx solution to (|4.13|) . Another heuristics indication for the choice of (|4.13p is based on 
the following argument. Assume that F is C 1 on its domain, and the maximum jl in (|4.13p 
is attained in the interior of the domain, so that it solves the fixed point equation : 

VF(fi) = jx. (4.14) 

By using a first-order Taylor approximation of F around the mean jx of Z under P^, we 
may approximate the estimator as 

= e F(Z)-{i'Z+\fi'fi _ e F{[i)+VF{[i)'{Z~[i)-i:i l Z+\[i'ii ( 415 ) 

Hence, for the choice of jl satisfying (|4.14p . the expression of the r.h.s. of (|4.15p collapses 
to a constant with no dependence on Z. Thus, applying importance sampling with such a 
jl would produce a zero-variance estimator if (|4.15p holds exactly, e.g. if F is linear, and it 
should produce a low-variance estimator if (|4.15p holds only approximately. 

The choice of fx = jl solution to (|4.13|) leads also to an asymptotically optimal IS- 
estimator in the following sense. First, notice that for any fx, we have by Cauchy-Schwarz's 
inequality : M^(fx) > (v e ) 2 , and so 

lime In M|(/i) > 21imelnf £ = 2 limelnE[e^ F(ze) ] 

From Varadhan's integral principle applied to the function z — > F(z), we thus deduce for 
any fi, 

limelnMlGu) > 2 sup[F(z) - \z' z] = 2[F(£) - h'jl}. 

e^O z 2 2 

Hence, 2[F(jl) — \jl'jl\ is the best-possible exponential decay rate for a jU e TS estimator $ e . 
Now, by choosing fx = jl, and from (|4,10p . (|4.12p . we have 

hm elnM £ 2 (A) = 2[F([l) -\jl' jl], 
which shows that the /i e -IS estimator is asymptotically optimal. 
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Remark 4.1 From the first-order approximation of F in (|4.15p . we see that in order to 
obtain further variance reduction, it is natural to address the quadratic component of F. 
This can be achieved by a method of stratified sampling as developed in [25J. 

Recently, the above approach of [25] was extended in [29J to a continuous-time setting, 
where the optimal deterministic drift in the Black-Scholes model is characterized as the 
solution to a classical one-dimensional variational problem. 

4.2 Computation of barrier crossing probabilities and sharp large devia- 
tions 

In this paragraph, we present a simulation procedure for computing the probability that a 
diffusion process crosses pre-specified barriers in a given time interval [0,T]. Let (-Xt)t£[o,Tl 
be a diffusion process in M. d , 

dX t = b{X t )dt + a{X t )dW t 

and r is the exit time of X from some domain T of eventually depending on time : 

r = inf{tG [0,T] :X t $r(t)}, 

with the usual convention that inf = do. Such a quantity appears typically in finance in 
the computation of barrier options, for example with a knock-out option : 

C = E[e- rT g(X T )l T>T ], (4.16) 

with T(t) = (— oo, U(t)) in the case of single barrier options, and T(t) = (L(t),U(t)), for 
double barrier options. Here, L, U are real functions : [0, oo) — ► (0, oo) s.t. L < U. 

The direct naive approach would consist first of simulating the process X on [0, T] 
through a discrete Euler scheme of step size e = T/n = ij+i — ij, i = 0, . . . , n : 

X! i+1 = Xl + b(Xl)e + a(Xl)(W ti+1 -W u ), 

and the exit time r is approximated by the first time the discretized process reaches the 
barrier : 

f £ = inf{t, -.XliTiti)}. 

Then, the barrier option price Co in (|4.16|) is approximated by Monte-Carlo simulations of 
the quantity 

C £ = E[e- rT g(X^)l f . >T ]. 

In this procedure, one considers that the price diffusion is killed if there exists a value Xf., 
which is out of the domain r(tj). Hence, such an approach is suboptimal since it does 
not control the diffusion path between two successive dates t{ and tj+i : the diffusion path 
could have crossed the barriers and come back to the domain without being detected. In 
this case, one over-estimates the exit time probability of {t > T}. This suboptimality is 
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confirmed by the property that the error between Co and Cq is of order y/e, as shown in 
[28], instead of the usual order e obtained for standard vanilla options. 

In order to improve the above procedure, we need to determine the probability that 
the process X crosses the barrier between discrete simulation times. We then consider the 
continuous Euler scheme 

Xf = Xl+b(Xl)(t-t t ) + a(Xl)(W t -W u ), U<t<t i+1 , 

which evolves as a Brownian with drift between two time discretizations ij, ij+i = ij + e. 
Given a simulation path of (Xf.)i, and values Xf. = x^, = Xj+i, we denote 

P Uxi,x i+1 ) = F[3te [U,t i+1 ] :X% tT(t i )\(Xf i ,Xf i+1 ) = (x i ,x i+1 ] 

the exit probability of the Euler scheme conditionally on the simulated path values. The cor- 
rection Monte-Carlo procedure works then as follows : with probability p\ = pf(Xf. , Xf ), 
we stop the simulation by considering that the diffusion is killed, and we set r £ = if, with 
probability 1— p\ , we carry on the simulation. The approximation of (|4.16p is thus computed 
by Monte-Carlo simulations of 

C £ = E[e- rT (X £ T -K)+l Te>T ]. 

We then recover a rate of convergence of order e for Cq — Co, see [23]. 

The effective implementation of this corrected procedure requires the calculation of p\. 
Notice that on the interval [tj,ij+i], the diffusion X s conditionned to Xf. = Xi, Xf j+1 = 
Xj_|_i, is a brownian bridge : it coincides in distribution with the process 

Bl' E = x i + -(x i+1 -x i ) + a(x i ){W t --W £ ), 0<t<e, 

and so by time change t — > t/e, with the process 

Y i,s ._ gig = x . + t ( x . +l _ x j + ^ (7 ( x .> ) ( Wt _ tWi ^ o < i < 1. 

It is known that the process Y' l,e is solution to the s.d.e. 

dY i,e = t_i±l dt + ^ a ( x .} dWu o < t < 1, 

1q X{. 

The probability p\ can then be expressed as 

pt{xi,x i+ i) = P[T i,e <l], where = inf{t>0 : Y?' e £ T(U + et)}. (4.17) 

In the case of a half-space, i.e. single constant barrier, one has an explicit expression for 
the exit probability of a Brownian bridge. For example, if T(t) = (— oo, U), we have 

pf(xi,x i+1 ) = exp f- IjJ Xl " Xl+l \ w ith ^(xi^Xi+i) = ^ (U - Xi)(U - x i+1 ). 

V £ / (J [X'l J 
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In the general case, we do not have analytical expressions for p|, and one has to rely on 
simulation techniques or asymptotic approximations. We shall here consider asymptotic 
techniques based on large deviations and Freidlin-Wentzell theory. Let us illustrate this 
point in the case of two time-dependent barriers, i.e. T(t) = (L(t), U(t)) for smooth barriers 
functions L < U. Problem (|4.17|) does not exactly fit into the Freidlin-Wentzell framework 
considered in paragraph 12.31 but was adapted for Brownian bridges with time-dependent 
barriers in [7J. We then have the large deviation estimate for p| : 



lunelnpl(xi,x i+ i) = -I LU (xi,x i+ i), 



where I L v (xi, Xi+x) is the infimum of the functional 

over all absolutely continuous paths y(.) on [0, 1] s.t. y(0) = x^ and there exists some t E 
[0, 1] for which y(t) < L(ti) or y(t) > U(U). This infimum is a classical problem of calculus 
of variations, which is explicitly solved and gives for any Xi,Xi + i G (L(ti), U(ti)) (otherwise 
I (xi,x i+1 ) = 0) : 



^f^(U(ti) - Xi )(U(ti) -x i+1 ) if Xi + x l+1 > L(U) + U(ti) 
(xi - L(ti))(x i+1 - L(ti)) if Xi + x i+1 < L(ti) + U (U). 



In order to remove the log estimate on p|, we need a sharper large deviation estimate, and 
this is analyzed by the results of [T7] recalled in paragraph 12.31 More precisely, we have 

p £ i(xi,x i+ i) = exp ( h ' u ^ 1+1 -w LU (xi,x i+1 )j(l + 0(£)), 

where w L v (xi,Xi + i) is explicited in [7] as 

f ^n(U{ti)-Xi)U'(ti) if Xi + x i+1 > L(ti) + U(ti) 
L ' uK U l+ ) { ^(xi-L(t t ))L'(ti) if Xi + x w <L(ti) + U(ti). 

Some recent extensions of this large deviations approach to the computation of exit prob- 
abilities for multivariate Brownian bridge are studied in [32], which also gives applications 
for the estimation of default probabilities in credit risk models, and the pricing of credit 
default swaps. 



5 Large deviations in risk management 
5.1 Large portfolio losses in credit risk 

5.1.1 Portfolio credit risk in a single factor normal copula model 

A basic problem in measuring portfolio credit risk is determining the distribution of losses 
from default over a fixed horizon. Credit portfolios are often large, including exposure to 
thousands of obligors, and the default probabilities of high-quality credits are extremely 
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small. These features in credit risk context lead to consider rare but significant large loss 
events, and emphasis is put on the small probabilities of large losses, as these are relevant 
for calculation of value at risk and related risk measures. 
We use the following notation : 

n = number of obligors to which portfolio is exposed, 

Yfc = default indicator (= 1 if default, otherwise) for k-th obligor, 

Pk = marginal probability that k-th obligor defaults, i.e. pk = = 1]j 

Cfc = loss resulting from default of the fc-th obligor, 

L n = c{Yi + . . . + c n Y n = total loss from defaults. 

We are interested in estimating tail probabilities ¥[L n > £ n ] in the limiting regime at 
increasingly high loss thresholds £ n , and rarity of large losses resulting from a large number 
n of obligors and multiple defaults. 

For simplicity, we consider a homogeneous portfolio where all pk are equal to p, and 
all Cfc are equal constant to 1. An essential feature for credit risk management is the 
mechanism used to model the dependence across sources of credit risk. The dependence 
among obligors is modelled by the dependence among the default indicators Y/~. This 
dependence is introduced through a normal copula model as follows : each default indicator 
is represented as 

Y k = \x k >x k }, k = l,...,n, 

where {X\, . . . , X n ) is a multivariate normal vector. Without loss of generality, we take each 
Xk to have a standard normal distribution, and we choose Xk to match the marginal default 
probability pk, i.e. Xk = <3? -1 (l — Pk) = ~^~ 1 {Pk)i with <I> cumulative normal distribution. 
We also denote if = the density of the normal distribution. The correlations along the 
Xk, which determine the dependence among the Yk, are specified through a single factor 
model of the form : 

X k = pZ + ^l-p^ek, k = l,...,n. (5.1) 

where Z has the standard normal distribution M(0, 1), £k are independent jV(0, 1) distribu- 
tion, and Z is independent of £k, k = 1, . . . , n. Z is called systematic risk factor (industry, 
regional risk factors for example ...), and is an idiosyncratic risk associated with the A;-th 
obligor. The constant p in [0, 1) is a factor loading on the single factor Z, and assumed 
here to be identical for all obligors. We shall distinguish the case of independent obligors 
(p = 0), and dependent obligors (p > 0). More general multivariate factor models with 
inhomogeneous obligors are studied in |26j . 

5.1.2 Independent obligors 

In this case, p = 0, the default indicators Yk are i.i.d. with Bernoulli distribution of 
parameter p, and L n is a binomial distribution of parameters n and p. By the law of large 
numbers, L n /n converges to p. Hence, in order that the loss event {L n > l n } becomes rare 
(without being trivially impossible), we let l n /n approach q G {p, 1). It is then appropriate 
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to specify l n = nq with p < q < 1. From Cramer's theorem and the expressions of the 
c.g.f. of the Bernoulli distribution and its Fenchel-Legendre transform, we obtain the large 
deviation result for the loss probability : 

lim-lnP[L n > nq] = -qln (-) - (1 - q) ln (— — ) < 0. 

n— » n p 1 — p 

Remark 5.1 By denoting T(9) = ln(l — p + pe e ) the c.g.f. of Y k , we have an IS (unbiased) 
estimator of W[L n > nq] by taking the average of independent replications of 

exp(-6>L n + nT(0))l Ln > nq 

where L n is sampled with a default probability p(9) = P^Yfc = 1] = pe 9 /(l — p + pe d ). 
Moreover, see Remark 12. 2\ this estimator is asymptotically optimal, as n goes to infinity, 
for the choice of parameter 6 q > attaining the argmax in 9q — T(9). 

5.1.3 Dependent obligors 

We consider the case where p > 0. Then, conditionally on the factor Z, the default indicators 
Yj, are i.i.d. with Bernoulli distribution of parameter : 



p(Z) = P[Y k = 1\Z] = F[ P Z p*e k > -§-\p)\Z] 

- *f^^V (5-2) 



Hence, by the law of large numbers, L n /n converges in law to the random variable p(Z) 
valued in (0, 1). In order that {L n > l n } becomes a rare event (without being impossible) 
as n increases, we therefore let l n /n approach 1 from below. We then set 

l n = nq n , with q n < 1, q n /* 1 as n — > oo. (5-3) 

Actually, we assume that the rate of increase of q n to 1 is of order n~ a with a < 1 : 

1 - Qn = 0(n- a ), with < a < 1. (5.4) 

We then state the large deviations result for the large loss threshold regime. 

Theorem 5.1 In the single-factor homogeneous portfolio credit risk model (15. and with 
large threshold l n as in (|5.3p - (l5.4p . we have 

1 1-p 2 
Jim — — lnP[L n > nq n ] = -a ^ — . 



n— »oo 



In n p z 



Observe that in the above theorem, we normalize by Inn, indicating that the probability 
decays like n~ 7 , with 7 = a(l — p 2 )/p 2 . We find that the decay rate is determined by the 
effect of the dependence structure in the Gaussian copula model. When p is small (weak 
dependence between sources of credit risk), large losses occur very rarely, which is formalized 
by a high decay rate. In the opposite case, this decay rate is small when p tends to one, 
which means that large losses are most likely to result from systematic risk factors. 
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Proof. 1) We first prove the lower bound : 

1 1 - p 2 

liminf- lnP[L n > nq n ] > —a ^ — . (5-5) 

n— >oo Inn p z 

From Bayes formula, we have 

F[L n >nq n ] > F[L n > nq n ,p(Z) = q n ] 

= F[L n >nq n \p(Z)>q n ]F[p(Z)>q n ]. (5.6) 

For any n > 1, we define z n S R the solution to 

p{z n ) = q n , n> 1. 

Since p{.) is an increasing one to one function, we have {p(Z) > q n } = {Z > z n }. Moreover, 
observing that L n is an increasing function of Z, we get 

> nq n \p{Z) > q n ] = F[L n >nq n \Z> z n ] 

> F[L n > nq n \Z = z n ] = F[L n > nq n \p{Z) = q n ], 

so that from (15,61) 



>nq n ) > F[L n > nq n \p(Z) = q n ]F[Z > z n }. (5.7) 
Now given p(Z) = q n , L n is binomially distributed with parameters n and q n , and thus 

F[L n > nq n \p(Z) = q n ] > 1 - $(0) = ^(> 0). (5.8) 

We focus on the tail probability F[Z > z n ] as n goes to infinity. First, observe that since 
q n goes to 1, we have z n going to infinity as n tends to infinity. Furthermore, from the 
expression (15.2D of p(z), the rate of decrease (15. 4ft . and using the property that 1 — $(x) ~ 
(p(x)/x as x — * oo, we have 



0(n- a ) = l-q n = l-p(z n ) = 1-$ 



V^J 2 



exp - -( 



pz n + <5>- l {p) ^\ 2 V y/T^ 
as n — ► oo, so that by taking logarithm : 



This implies 



By writing 



1 p 2 z 2 

alnn--- ^-ln,z n = 0(1). 

2 1 — p z 



z 2 1 - o 2 

lim = 2a— (5.9) 

n.— »oc In n p 2 



F[Z>z n ] = F[z n <Z<z n + l] 

> 4=ex P (-^ + l) 2 ) ) 
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we deduce with (15.9 



1 l-p 2 

liminf- \n¥[Z > z n ] > a f-. 

n— >oo Inn p l 

Combining with (|5.7p and (|5.8p . we get the required lower bound (|5.5 

2) We now focus on the upper bound 

1 l-p 2 

— lnP[L n > nq n ] < -a ; 

inn p 

We introduce the conditional c.g.f. of : 



limsup - — ; - lnP[X n > nq n ] < —a — ^ — . (5.10) 



T(9,z) = ]nE[e eY "\Z = z] (5.11) 
= ln(l -p(z) +p(z)e 9 ). (5.12) 

Then, for any 6 > 0, we get by Chebichev's inequality, 

F[L n >nq n \Z] < E[e 0(L " _n9n) |Z] = e -<0q n -T{e,z)) ^ 

so that 

F[L n >nq n \Z] < e~ nT *^ z \ (5.13) 

where 

T(q,z) = S Meq-mz)] = [ qln{ _ k) + {1 _ q)ln{ _l^_ l if pW<g <l. 

By taking expectation on both sides on (|5.13|) . we get 

nL n >nq n ] < E[e F "W], (5.14) 

where we set F n (z) = —nT*(q n ,z). Since p > 0, the function p(z) is increasing in z, so 
r(0, z) is an increasing function of z for all 9 > 0. Hence, F n (z) is an increasing function of 
z, which is nonpositive and attains its maximum value 0, for all z s.t. q n = p{z n ) < p(z), 
i.e. z > z n . Moreover, by differentiation, we can check that F n is a concave function of z. 
We now introduce a change of measure. The idea is to shift the factor mean to reduce the 
variance of the term e Fn ^ in the r.h.s. of (|5.14j) . We consider the change of measure 
that puts the distribution of Z to M(p, 1). Its likelihood ratio is given by 

— = exp (pZ --p), 

so that 

E[e F »( z )] = E fl [e F ^ z ^ z+ ^ 2 }, 

where denotes the expectation under P M . By concavity of F n , we have F n (Z) < F n (p) + 
F' n {p)(Z — p), so that 

E[e F »(z)] < E ^ e F«M«KM-riz-i*nV)+fr 3 ]. (5.15) 
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We now choose \i = p n solution to 

KiP-n) = Pn, (5.16) 

so that the term in the expectation in the r.h.s. of (|5.15|) does not depend on Z, and is 
therefore a constant term (with zero- variance) . Such a p n exists, since, by strict concavity 
of the function z — > F n (z) — ^z 2 , equation (|5.16p is the first-order equation associated to 
the optimization problem : 

p n = argmax[F n (//) - \p 2 ]. 

With this choice of factor mean p n , and by inequalities (|5.14p . (|5.15p . we get 

F[L n >nq n ] < e Fn ^ n) ~^ . (5.17) 

We now prove that p n j 'z n converges to 1 as n goes to infinity. Actually, we show that for 
all e > 0, there is uq large enough so that for all n > no, z n (l — e) < p, n < z n . Since 
F' n {p n ) — p n = 0, and the function F^(z) — z is decreasing by concavity F n (z) — z 2 /2, it 
suffices to show that 

F' n {z n {l - e)) - z n {\ - e) > and F' n (z n ) - z n < 0. (5.18) 

We have 

F ni z ) = n \ -73 1 -7-rJ ^ 



The second inequality in f|5. 18|) holds since F' n (z n ) = and z n > for g n > p, hence for 
n large enough. Actually, z n goes to infinity as n goes to infinity from (|5.9p . For the first 
inequality in (|5.18p . we use the property that 1 — ~ ip(x)/x as x — > oo, so that 

lim ^- — — - = 1, and lim ^— — — = 0. 

>p(z n (l-£)) n-+oo 1 -p(z n (l -£)) 



n— +oo 



From (j5.9fl . we have 



- e )+*-l(ph = 0(n -a(l- £)2)) 
v a/1 - p 2 ' 

and therefore 

F' n {z n (l-e)) = Oin 1 -^ 2 ). 
Moreover, from (|5.9p and as a < 1, we have 

z n (l-e) = 0(Vh^n) = o(n 1 - a(1 - £)2 ) 
We deduce that for n large enough F' n {z n (\ — e)) — z n (l — e) > and so (|5.18p . 
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Finally, recalling that F n is nonpositive, and from (|5.17p . we obtain : 

1 1 u 2 1 z 2 1 — o 2 

limsup \n¥[L n >nq n ] < — limf-^- = — lim — ^ = -a f-. (5.19) 

n->oo Inn 2 n-» Inn 2 ™^ Inn ,cr 

□ 

Application : asymptotic optimality of two-step importance sampling estimator 

Consider the estimation problem of P[L n > nq\. We apply a two-step importance sampling 
(IS) by using IS conditional on the common factors Z and IS to the distribution of the 
factors Z. Observe that conditioning on Z reduces to the problem of the independent case 
studied in the previous paragraph, with default probability p(Z) as defined in (|5,2|) . and 
c.g.f. F(.,Z) in (|5.11|) . Choose 6 qn (Z) > attaining the argmax in 9q n — T(9,Z), and 
return the estimator 

eM-O qn (Z)Ln + nT(e qn (Z),Z))l Ln > nqn , 

where L n is sampled with a default probability p(6 q (Z), Z) = p(Z)e eqn ^ / (1 — p(Z) + 
p(Z)e Sqn ( z ^). This provides an unbiased conditional estimator of f[L n > nq n \Z] and an 
asymptotically optimal conditional variance. We further apply IS to the factor Z ~ A/"(0, 1) 
under P, by shifting the factor mean to fi, and then considering the estimator 

exp(- U Z + i U 2 ) exp(-6 qn (Z)L n + nT(e qn (Z), Z))l Ln > nqn , (5.20) 

where Z is sampled from 1). To summarize, the two-step IS estimator is generated as 
follows : 

• Sample Z from JV(/x, 1) 

• Compute Qn (Z) and p(9 qn (Z), Z) 

• Return the estimator (|5.20p where L n is sampled with default probability p{9 qn (Z), Z). 

By construction, this provides an unbiaised estimator of ¥[L n > nq n ], and the key point 
is to specify the choice of \x in order to reduce the global variance or equivalently the 
second moment M 2 (fi, q n ) of this estimator. First, recall from Cauchy-Schwarz's inequality : 
M 2 (fj,,q n ) > (P[L n > nq]) 2 , so that the fastest possible rate of decay of M 2 (fi,q n ) is twice 
the probability itself : 

hminf T^—\nM 2 (q n ,fj,) > 2 lim ^— lnP[L n > nq n ]. (5.21) 
n— >oo m n n^oo In n 

To achieve this twice rate, we proceed as follows. Denoting by E the expectation under the 
IS distribution, we have 

M 2 (^q n ) = E[eM-^Z + i2 2 )exp(-29 qn (Z)L n + 2nT(6 qn (Z),Z))l Ln > nqn 
< E[exp(-2uZ + fi 2 ) exp(-2n6 qn (Z)q n + 2nT(9 qri (Z), Z)) 
E exp(-2uZ + u 2 + 2i^(Z))l, 
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by definition of 9 qn {Z) and F n (z) = — n sup g>0 [9q n — T(9, z)] introduced in the proof of the 
upper bound in Theorem 15.11 As in (|5.15j) . (I5,17h . by choosing /x = p n solution to F n (/j, n ) 
= fi n , we then get 

Mn(nn,q n ) < exp(2F n (/x„) - p 2 n ) < e xp(-/4), 
since F n is nonpositive. From (|5.19p . this yields 

limsup-^— lnM^(/i n ,g n ) < -2a- — ^— = 2 lim lnP[L n > nq n ], 

n-^oo inn /r n-*oomn 

which proves together with (|5.2ip that 

lim — !— \nMl(pL n ,q n ) = -2a- — ^— = 2 lim lnP[JD n > nq n ], 

n-»oo In n /r n-^oo In n 

and thus the estimator (|5.2(jp for the choice fi = {i n is asymptotically optimal. The choice 
of = z n also leads to an asymptotically optimal estimator. 

Remark 5.2 We also prove by similar methods large deviation results for the loss distri- 
bution in the limiting regime where individual loss probabilities decrease toward zero, see 
|26j for the details. This setting is relevant to portfolios of highly-rated obligors, for which 
one-year default probabilities are extremely small. This is also relevant to measuring risk 
over short time horizons. In this limiting regime, we set 

l n = nq, with < q < 1, p = p n = 0(e~ na ), with a > 0. 

Then, 

lim — lnP[L„, > nq] = ^, 

rwoo n ' p 

and we may construct similarly as in the case of large losses, a two-step IS asymptotically 
optimal estimator. 



5.2 A large deviations approach to optimal long term investment 
5.2.1 An asymptotic outperforming benchmark criterion 

A popular approach for institutional managers is concerned about the performance of their 
portfolio relative to the achievement of a given benchmark. This means that investors are 
interested in maximizing the probability that their wealth exceed a predetermined index. 
Equivalently, this may be also formulated as the problem of minimizing the probability 
that the wealth of the investor falls below a specified value. This target problem was 
studied by several authors for a goal achievement in finite time horizon, see e.g. [11] or 
|20j . Recently, and in a static framework, the paper [40 J considered an asymptotic version 
of this outperformance criterion when time horizon goes to infinity, which leads to a large 
deviations portfolio criterion. To illustrate the purpose, let us consider the following toy 
example. Suppose that an investor trades a number a of shares in stock of price S, and 
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keep it until time T. Her wealth at time T is then = aSx- For simplicity, we take a 
Bachelier model for the stock price : St = lit + crWt, where W is a brownian motion. We 
now look at the behavior of the average wealth when time horizon T goes to infinity. By 
the law of large numbers, for any a £ R, the average wealth converges a.s. to : 

X T := — = an + aa— — > a/x, 

when T goes to infinity. When considering positive stock price, as in the Black-Scholes 
model, the relevant ergodic mean is the average of the growth rate, i.e. the logarithm of 
the wealth. Fix some benchmark level Then, from Cramer's theorem, the probability 

of outperforming x decays exponentially fast as : 

in the sense that limT^oo 7 lnP[X^ > x] = —I(x, a), where 

I(x,a) = sup[9x — T(9, a)} 



9& 
1 



r(9,a) = -lnEfe^T]. 

Thus, the lower is the decay rate I(x,a), the more chance there is of realizing a portfolio 
performance above x. The asymptotic version of the outperforming benchmark criterion is 
then formulated as : 

sup lira ilnP[X?; > x] = - inf I(x, a). (5.22) 
In this simple example, the quantities involved are all explicit : 



T(6,a) = 6aii + 
I(x,a) = 



1 / afi—x 

2 1 OUT 



2 

2 



a / 
0, a = 0, x = 

00, a = 0, 1 / 0. 



The solution to (|5.22p is then given by a* = x/fi, which means that the associated expected 
wealth E[X^*] is equal to the target x. 

We now develop an asymptotic dynamic version of the outperformance management 
criterion due to [36]. Such a problem corresponds to an ergodic objective of beating a given 
benchmark, and may be of particular interest for institutional managers with long term 
horizon, like mutual funds. On the other hand, stationary long term horizon problems are 
expected to be more tractable than finite horizon problems, and should provide some good 
insight for management problems with long, but finite, time horizon. 

We formulate the problem in a rather abstract setting. Let Z = (X, Y) be a process 
valued in R x M d , controlled by a, a control process valued in some subset A of R 9 . We 
denote by A the set of control processes. As usual, to alleviate notations, we omitted the 
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dependence of Z = (X, Y) in a € A. We shall then study the large deviations control 
problem : 

v(x) = suplimsup^lnP[Xr > x], iGK, (5.23) 

where Xt = Xt/T. The variable X should typically be viewed in finance as the (logarithm) 
of the wealth process, Y are factors on market (stock, volatility ...), and a represents the 
trading portfolio. 

5.2.2 Duality to the large deviations control problem 

The large deviations control problem (15.230 is a non standard stochastic control problem, 
where the objective is usually formulated as an expectation of some functional to optimize. 
In particular, in a Markovian continuous-time setting, we do not know if there is a dy- 
namic programming principle and a corresponding Hamilton-Jacobi-Bellman equation for 
our problem. We shall actually adopt a duality approach based on the relation relating 
rate function of a LDP and cumulant generating function. The formal derivation is the 
following. Given a G A, if there is a LDP for X? = Xt/T, its rate function I(.,a) should 
be related by the Fenchel-Legendre transform : 

I(x,a) = sup[9x — T(8,a)], 
e 

to the c.g.f. 

T(6,a) = limsup^lnE[e eXT ]. (5.24) 

In this case, we would get 

v(x) = sup limsup — lnP[Xr > x] = — inf I(x,a) 

a&A T^oo T a€A 

= — inf sup[9x — T(6, a)], 
aeA 

and so, provided that one could intervert infinum and supremum in the above relation 
(actually, the minmax theorem does not apply since A is not necessarily compact and a — > 
6x — T(9, a) is not convex) : 

v(x) = -sup[0x-r(0)], (5.25) 
e 

where 

r(0) = supT(0,a) = sup limsup^ In E[e eXT }. (5.26) 

aeA a^A T-^oo T 

Problem (|5.26p is the dual problem via (|5.25p to the original problem (|5.23p . We shall 
see in the next section that (|5.26p can be reformulated as a risk-sensitive ergodic control 
problem, which is more tractable than (15.23H and is studied by dynamic programming 
methods leading in some cases to explicit calculations. 
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First, we show rigorously the duality relation between the large deviations control prob- 
lem and the risk-sensitive control problem and how the optimal controls to the former one 
are related to the latter one. This result may be viewed as an extension of the Gartner-Ellis 
theorem with control components. 

Theorem 5.2 Suppose that there exists 9 G (0, oo] such that for all 9 E [0,9), there exists 
a solution a(9) G A to the dual problem T(9), with a limit in (|5.24p . i.e. 

T{9) = lim ^lnE exp (8X* (e) ^ . (5.27) 

Suppose also that T(9) is continuously differentiable on [0,9). Then for all x < T'(9) := 
lim A/ r(6»), we get 

v(x) = - sup [9x-T(9)]. (5.28) 

ee[o,e) 

Moreover, the sequence of controls 



& t {9[x + l)), T'{0)<x<Y'{9) 
at(e[T'(0) + ±)), x<T'(0), 



with 9(x) G (0,6*) s.t. T'(9(x)) = x G (T'(0),T'(9)) ) is nearly optimal in the sense that 

= v(x). 



lim lim sup — In ] 

T->oo T 



Xf' n > x 



Proof. 

Step 1. Let us consider the Fenchel-Legendre transform of the convex function V on [0,9) : 

F*(x) = sup [9x-T(8)], iel. (5.29) 
ee[o,S) 

Since T is C 1 on [0, 9), it is well-known (see e.g. Lemma 2.3.9 in |13j ) that the function T* 
is convex, nondecreasing and satisfies : 

= t «.*■>* - m*)), .f Ho) < x < : HS) 

o, if x < r (0), 



9(x)x-T*(x) > 9(x)x' -T*(x'), vT'(0) < x < T'(9), Vx' ^ x, (5.31) 

where 9{x) G (0,9) is s.t. V(9(x)) = x G (r'(0), T'(8)). Moreover, T* is continuous on 
{-<*>,?(§)). 

Step 2 : Upper bound. For all x G R, a G .A, an application of Chebycheff 's inequality 
yields : 

¥[X T >x] < exp(-6xT)R[exp(6X T )], V0€[O,§), 
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and so 



limsup^lnP[X T > x] < -9x + limsup - lnE[exp(0X T )], V0e[O,0). 

T~*oo T T-*oo T 

By definitions of T and F* , we deduce : 

sup limsup^ In P [X$ > x] < -T*(x). 



(5.32) 



Step 3 : Lower bound. Given x < T'(9), let us define the probability measure QS, on (0, Tt) 
via : 



exp 



(5.33) 



(x n )x% ' -r T (e( Xn ),a*> n ) 

where x n = x + 1/n if x > r'(0), x n = r'(0) + 1/n otherwise, a*' n = a(9(x n )), and 

T T (9,a) = lnE[exp(0X£)], 6 £ [0,9), a £ A. 

Here n is large enough so that x + 1/n < T'(9). We now take e > small enough so that 
x < x n — e and x n + e < T'(9). We then have : 



T 



s„ - £ < I? ' < x n + e 
dP 



-1 



{* n - 6 <j^*' n <*„+ e } a VT 



1 



> -9(x n )(x n + e) + -T T (9(x n ),a*> n ) 



T 

x n - e < Xtf'" < x n + e 



where we use (|5.33p in the last inequality. By definition of the dual problem, this yields : 



liminf-lnP[X£ ' >x] > -9(x n ) {x n + e) +T{9(x n )) 

T— >oo 1 



+ liminf — ln<Q>& 



> -r*(x n ) 



+ liminf — lnQ£ 

T—fOC T 



x n - e < X T < x n + e 



x n - e < Xt < x n + e 



(5.34) 



where the second inequality follows by the definition of T* (and actually holds with equality 
due to (|5.30p ). We now show that : 



lim inf — In Q T 



x n - e < Xt ' < x n + e 



0. 



(5.35) 



Denote by fj the c.g.f. under Q£ of Xf'". For all ( £ R, we have by (Q3|) : 

f T (0 := ln^[exp(CXf'")] 

= T T (9{x n ) + C, «*' n ) - T T {9(x n ), a*> n ). 
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Therefore, by definition of the dual problem and (|5.27p . we have for all C G [—9(x n ),9 

0(Xn)) : 



limsup^fj(C) < r(0(x n ) + C) - r(0(x n )). 



(5.36) 



As in part 1) of this proof, by Chebycheff's inequality, we have for all £ £ [0,0 — 9{x n )) : 



lim sup — In QJ 



>x n + e 



< -((x n + e) + limsup^fj(C) 



< -c(x n + e) + r(c + 0(x n ))-r(0(x n )), 

where the second inequality follows from (|5,36p . We deduce 



lim sup — In Q J 



X% > x n + e 



< -sup{C(x n + e)-r(C) :(€[«(4fl)} 

-r(0(x n ))+0(x n )(x n + e) 

< -r* (x n + e) - T(9(x n )) + 0(x n ) (x n + e) , 

= -r*(x n + e) + r*(x n ) + e0(x n ), (5.37) 

where the second inequality and the last equality follow from (I5.30p . Similarly, we have for 
all C G [-6(xn),0] : 



lim sup — In Q J 



< -C(x„-e) + limsup^f^(C) 

< -C{x n -e)+T(9{x n )+0-T{e(x n )), 



and so : 



lim sup — In Q J 



Xt 



<x n -e 



By (071) - (08| ). we then get : 



lim sup — In Q J 
1 



< -sup{C(x n -e)-r(C) :CG [O,0(x n )]} 

-r(0(z n )) + AW (sn - e) 

< -r*(x n -e)+r*(0(x n ))-£0(x n ). (5.38) 



M 



< max { lim sup — In Q J 



Xf' n >x n + e\ 



1 



; lim sup - In Q J 



T x n ~\~ £ 

< max{-r* (x n + e) + r* (x n ) + e0(x n ); — r* (x n - e) + T*(0(x„)) - £0(x„)} 

< 0, 

where the strict inequality follows from (|5.3ip . This implies that QJ[{X^ ' < x n — e} U 
{Xj, ' > x n + e}] -> and hence QJ[x n - e < X% ' < x n + e] -> 1 as T goes to infinity. 
In particular (|5.35p is satisfied, and by sending e to zero in (|5,34p . we get : 



liminf-lnP[X£ > x] > ~r*(x n ). 
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By continuity of T* on (— oo, T'(9)), we obtain by sending n to infinity and recalling that 
T*(x) = o = r*(r'(o)) for x < r'(0) : 

1 



liminfliminf-lnP[X£ ' > x] > -T*(x). 
This last inequality combined with (|5.32p ends the proof. 



□ 



Remark 5.3 Notice that in Theorem 15.21 the duality relation (|5.28p holds for x < T'(9). 
When T'(9) = oo, we say that fonction T is steep, so that (|5.28p holds for all x G R. We 
illustrate in the next section different cases where V is steep or not. 

Remark 5.4 In financial applications, Xt is the logarithm of an investor's wealth V t a at 
time t, at is the proportion of wealth invested in q risky assets S and Y is some economic 
factor influencing the dynamics of S and the savings account S°. Hence, in a diffusion 
model, we have 



dX t 



r(Y t ) + a'MY t ) - r(Y t )e q 



dt + a' t $(Y t )dW t , 



where /i(y) (resp. ^(y)) is the rate of return (resp. volatility) of the risky assets, r{y) is 
the interest rate, and e q is the unit vector in R 9 . 

Notice that the value function of the dual problem can be written as : 



r(0) 



lim — In E 

T^oc T 



Me) 



where Uq(c) = c e is a power utility function with Constant Relative Risk Aversion (CRRA) 
1 — 9 > provided that 9 < 1. Then, Theorem 15.21 means that for any target level x, the 
optimal over performance probability of growth rate is (approximately) directly related, for 
large T, to the expected CRRA utility of wealth, by : 



P[Xf > x] 



E 



U, 



9(x) 



V; 



T 



(x)xT 



(5.39) 



with the convention that 9{x) = for x < r'(0). Hence, 1 — 9{x) can be interpreted as a 
constant degree of relative risk aversion for an investor who has an overperformance target 
level x. Moreover, by strict convexity of function T* in (|5.29p . it is clear that 9{x) is strictly 
increasing for x > r'(0). So an investor with a higher target level x has a lower degree 
of relative risk aversion 1 — 9{x). In summary, Theorem 15.21 (or relation (|5.39p ) inversely 
relates the target level of growth rate to the degree of relative risk aversion in expected 
utility theory. 



5.2.3 Explicit calculations to the dual risk-sensitive control problem 

We now show that the dual control problem (|5.26p may be transformed via a change of 
probability measure into a risk-sensitive control problem. We consider the framework of a 
general diffusion model for Z = (X, Y) : 

dX t = b(X t ,Y t ,a t )dt + a(X t ,Y t ,a t )dWt in R (5.40) 
dY t = r](X t ,Y t ,a t )dt + a(X t ,Y t ,a t )dW t in R d , (5.41) 
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where W is a m-dimensional brownian motion on a filtered probability space (Q,J-,¥ = 
(^t)t>0, P) j and a = (at)i>o, the control process, is F-adapted and valued in some subset 
A of M q . We denote A the set of control processes. The coefficients b, r/, a and 7 are 
measurable functions of their arguments, and given a £ A and an initial condition, we 
assume the existence and uniqueness of a strong solution to (|5.4Up - (|5.41|) . which we also 
write by setting Z = (X, Y) : 



dZ t 



B(Z t ,at)dt + E(Z t ,at)dW t . 



(5.42) 



From the dynamics of X in (|5.40p , we may rewrite the Laplace transform of Xt as : 



E [exp [BXt)\ 



jx 



E 



expfaf b(Z t ,a t )dt + 9 a{Z t ,a t )dW t 



1,Z t ,a t )dt 



where 



(5.43) 



,z,a) 



0b(z,a) + — \a{z,a)\ 2 , 



and £t(9) is the Doleans-Dade exponential local martingale 



£ 



exp 



a(Z u ,a u )dW u 

J a{Z u ,a u )dW u - y J \a(Z u ,a u )\ 2 dv)j , t > 0. (5.44) 



If £ a (#) is a "true" martingale, it defines a probability measure Q under which, by Girsanov's 
theorem, the dynamics of Z is given by : 



where 



dZ t = G(e,Z u a t )dt + Z(Z u a t )dWy, 
is a O-Brownian motion and 



G(9,z,a) 



b(z,a) + 9\cr(z,a)\ 2 
r)(z, a) + 9 n fa'(z, a) 



Hence, the dual problem may be written as a stochastic control problem with exponential 
integral cost criterion : 

fT 



T(9) = sup lim sup — In E 8 " 



exp 



), Z t ,a t )dt 



, 9>0. 



(5.45) 



For fixed 9, this is an ergodic risk-sensitive control problem which has been studied by 
several authors, see e.g. [18], [10] or [39] in a discrete-time setting. It admits a dynamic 
programming equation : 



A(9) 



sup 



itr (SS'(z, a)D 2 <p e ) + s, a).V<^ 



, ze 



(5.46) 
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The unknown is the pair (A(0), <pg) eRx C 2 (R rf+1 ), and A(0) is a candidate for T(9). The 
above P.D.E. is formally derived by considering the finite horizon problem 



T 

exp ( / £(9,Z t ,a t )dt 



ug(T,z) = sup 

by writing the Bellman equation for this classical control problem and by making the 
logarithm transformation 

\nu e {T,z) ~ A(0)T + <f> e (z), 

for large T. 

One can prove rigorously that a pair solution (A(0),0#) to the PDE (|5.46p provides 
a solution A(0) = T(6) to the dual problem (|5.26p . with an optimal control given by the 
argument max in (|5,46p . This is called a verification theorem in stochastic control theory. 
Actually, there may have multiple solutions cpg to (|5,46p (even up to a constant), and we 
need some ergodicity condition to select the good one that satisfies the verification theorem. 
We refer to [37] for the details, and we illustrate our purpose with an example with explicit 
calculations. 

We consider a one-factor model where the bond price S° and the stock price S evolve 
according to : 

dS® dS 

-^f = (a + b Y t )dt, = (a + bY t )dt + adW t , 

with a factor Y as an Ornstein-Uhlenbeck ergodic process: 

dY t = -kY t dt + dB t , 

where ao, bo, a, b are constants, k, a are positive constants, and W, B are two brownian mo- 
tions, supposed non correlated for simplicity. This includes Black-Scholes, Platen-Rebolledo 
or Vasicek models. The (self-financed) wealth process Vt with a proportion at invested in 
stock, follows the dynamics : dVt = atVt^- + (1 — at)Vt~^-, and so the logarithm of the 
wealth process Xt = In 14 is governed by a linear-quadratic model : 

dX t = (0 O Y 2 + (3 x a 2 t + (3 2 Y t a t + (3 3 Y t + /3 4 « 4 + P 5 )dt + (6 Y t + 5 x a t + 5 2 )dW&A7) 

where in our context, (3o = 0, 0\ = —a 2 /2, (5 2 = b — bo, (3s = bo, /?4 = a — ao, 05 = ao, 5o = 
0, 8% = a and 5 2 = 0. Without loss of generality, we may assume that a = 1 and so 0\ = 
— 1/2 (embedded into a) and (3$ = (embedded into x). The P.D.E. (I5.46P simplifies into 
the search of a pair (A(0), </>#) with 4>g depending only on y and solution to : 



A(0) = \<Pe-ky ( l ) ' e + ^' e \ 2 + 9^o + e^)y 2 + 9((3 3 + 98o5 2 )y + 



' ° m + 08o8 1 )y + (3 4 + 95 1 8 2 } 2 . (5.48) 



2 1 - 95\ 

Moreover, the maximum in a G M of (|5.46p is attained for 



<x(P,y) = 1-95* ' ^ ^ 



47 



The above calculations are valid only for < 9 < 1/5 2 . We are looking for a quadratic 
solution to the ordinary differential equation (|5.48p : 

Mv) = ^A(9)y 2 + B(9)y. 

By substituting into (|5.48p . and cancelling terms in y 2 , y and constant terms, we obtain 

• a polynomial second degree equation for A(9) 

• a linear equation for B{9), given A{8) 

• A(#) is then expressed explicitly in function of A{6) and B{9) from (|5.48[) . 

The existence of a solution to the second degree equation for A(9), through the nonnegativ- 
ity of the discriminant, allows to determine the bound 6 and so the interval [0, 6) on which 
A is well-defined and finite. Moreover, we find two possible roots to the polynomial second 
degree equation for A(9), but only one satisfies the ergodicity condition. From Theorem 
15.21 we deduce that 

v(x) = - sup [9x-A(9)), Va?<A'(0), (5.50) 
ee[o,S) 

with a sequence of nearly optimal controls given by : 

*,n _ / a(9(x + ±),Yt), A'(0) < x < A'(0) 
\ a(0(A'(O) + ±),F t ), x<A'(0), 

with 9(x) £ (0, 9) s.t. A'(9(x)) = x. In the one-factor model described above, the function A 
is steep, i.e. A' (9) = oo, and so (|5.50p holds for all i£K. For example, in the Black-Scholes 
model, i.e. bo = b = 0, we obtain 

r(«0 = A(O) = \-^ e (^p)\ **e<B = i, 



v(x) = - sup [9x-F(9)] 



-(y/E - v^) 2 , if x > x := r (0) = 



ee[o,i) ' 1 °) if x < x, 



9(x) = 1 — \Jxjx if x > x, and otherwise, and 



y2x, if x > x 
if x < x. 



a—ao 



We observe that for an index value x small enough, actually x < x, the optimal investment 
for our large deviations criterion is equal to the optimal investment of the Merton's problem 
for an investor with relative risk aversion one. When the value index is larger than x, the 
optimal investment is increasing with x, with a degree of relative risk aversion 1 — 9{x) 
decreasing in x. 

In the more general linear-quadratic model (|5.47p . A may be steep or not depending on 
the parameters Pi and Si. We refer to [37] for the details. Some variants and extensions of 
this large deviations control problem are studied in [30] and pQ. 
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6 Conclusion 



In these notes, we developed some applications and emphasized methods of large deviations 
in finance and insurance. These applications are multiple, and our presentation is by no 
means exhaustive. There are numerous works dealing with large deviations techniques in 
the context of insurance, see e.g. [H], [I], or more recently [33J and [33]. We also cite 
the paper [6] , which develops asymptotic formula for calculating implied volatility of index 
options. Large deviation principle for backward stochastic differential equations is used by 
[5J in a setting motivated by credit risk management. Other papers using large deviations in 
portfolio management are [38] and [8]. Some aspects of large deviations applied to problems 
in macroeconomics are studied in |41j . 

From a general viewpoint, questions related to extremal events are embedded into the 
extreme value theory, and we refer to the classical book [16] for a development of this 
subject, especially regarding applications in finance and insurance. 
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